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Preface 


These notes are being written for an introductory honors calculus class, Math 
1551, at LSU in the Fall of 2011. 

The approach is quite different from that of standard calculus texts. (In 
fact if I had to choose a subtitle for these notes, it would be ‘An Anti- 
calculus-text Book’.) We use natural, but occasionally unusual, definitions 
for basic concepts such as limits and tangents. We also avoid several stranger 
aspects of the universe of calculus texts, such as counterintuitive notions of 
what counts as ‘local maximum’ or obsessing over ‘convex up/down’, and 
stay with practice that is consistent with the way mathematicians actually 
work. For most topics we show how to work with the method first and then 
go deeper into proofs and finer points. We prove several results in sharper 
formulations than seen in calculus texts. Among drastic departures from the 
standard approaches we work with the extended real line R* = RU{—o«, oo}, 
and define limits in such a way that no special exceptions need to be made 
for limits involving too. We follow a consistent strategy of using suprema 
and infima, which form a running theme through the historical development 
of the real line and calculus. An entire chapter is devoted to convexity. 

For various corrections and comments I thank Justin Katz. 
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Introduction 


There are two fundamental notions that led to the development of calculus 
historically: (i) the measurement of areas of curved regions, and (ii) the study 
of tangents to curves. These apparently disconnected themes, formalized 
in integral calculus and differential calculus, respectively, come together in 
the fundamental theorem of calculus, that makes the subject so useful and 
powerful. 

Lengths of line segments are measured by comparing to a chosen ‘unit’ 
of length. For areas the natural extension would be to measure the area of 
a region by counting the number of unit squares (squares with unit-length 
sides) needed to cover the region exactly. This works very well for a rectangle. 
Clearly a rectangle whose sides are of length 3 units and 4 units is covered 
exactly by a 3 x 4 grid of 12 unit squares. This leads naturally to 


area of a rectangle = product of the lengths of its sides. 


It requires only a few natural and simple steps to compute the area of a 
triangle (realize it as made up of two halves of rectangles) and more generally 
the area of a polygonal region. 

This strategy fails when we think of a curved region, such as a disk. 
There is surely no obvious way to cover a disk exactly with a finite number 
of squares or pieces of squares. (Whether such slicing and reassembly of 
regions is really possible, and in what sense, is a truly difficult problem.) 
The solution to this problem for specific curved regions was already known 
in the era of Greek mathematics. Consider non-overlapping squares lying 
inside a disk. Surely the area of the disk is at least as large as the sum of 
the areas of such squares: 


areas of polygonal regions inside disk < area of disk. 


On the other hand, if we cover the disk with squares, which spill over to the 
outside of the disk, and add up the areas of these squares we obtain an over 
estimate of the area of the disk: 


area of disk < areas of polygonal regions covering disk. 
Thus, the area of the disk ought to be 


the unique value that lies between these overestimates and the 
underestimates. 
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This idea, of pinning down a value by realizing it as being squeezed in 
between overestimates and underestimates is an enormously powerful idea, 
running all through the foundations of calculus. We will use this idea persis- 
tently in developing the basic notions of both integral calculus and differential 
calculus. 

Returning to the disk, it turns out that there is indeed such a unique 
value lying between the underestimates and the overestimates. The area of 
the disk scales up by a factor of 9 if the radius is scaled up by a factor of 3. 
Indeed, the ratio of the area of the disk to that of the square on the radius 
is the fundamental constant denoted 


Tl. 


The simplest underestimate for 7 is obtained by inscribing a square in the 
unit circle so that the diagonal of the square is a diameter of the circle; each 
side of this square has length V2 and so its area is 2. Next, drawing a square 
with width given by the diameter of the circle gives an overestimate for the 
area: 27, Thus 

Dy ecgp 


Archimedes, working with a 96-sided polygon obtained the estimates: 


1 10 
37 os 3e- 
Aside from such estimates a crucial point is that the overestimates and un- 
derestimates can be made ‘arbitrarily’ close to each other; more precisely, 
there can only be a unique number lying between the overestimates and un- 
derestimates. 
The value of 7 expressed as a decimal is: 


m = 3.1415926... 


where the dots mean that the decimal continues endlessly. What this means 
exactly is that a is the least number greater than all the finite decimal ap- 
proximations listed through the decimal expansion. This expression, though 
quite concrete, is frustrating in that there is no direct specificaton of, say, 
what the 25-th decimal entry is. More informative are formulas such as 
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or the enormously efficient but mysterious Ramanujan formula 


9801 


T= . 
ioe) 4n)!(1103+26390n 
2/2 os Tae 


What exactly these formulas mean will become clear when we have studied 
limits and infinite series sums. 

The number 7, originating in geometry, appears in a vast array of contexts 
in physics, chemistry, engineering and statistics. 

Archimedes was able to compute areas of more complex curved regions by 
very careful estimations, encoded in his method of exhaustion for computing 
areas. The great power of integral calculus is illustrated by the fact that it 
turns the genius of Archimedes’ method into an utterly routine calculation 
that a student of calculus can do in moments. 


Chapter 1 


Sets: Language and Notation 


Set theory provides the standard foundation for nearly all of mathematics. 
It is, however, an especially abstruse area in mathematics. In this chapter 
we will learn some language and notation from set theory that is widely used 
in mathematics. 


1.1 Sets and Elements 


A set, for practical purposes, is a collection of objects. These objects are 
called the elements of the set. For example, 


{—1, D,5, 8} 


is aset whose elements are —1, D, 5, and 8. This is a typical way of displaying 
a set: list its elements, separated by commas, within the braces { and }. 
Sometimes a set if best specified by describing its elements. For instance, 


{x : x is a prime number less than 10} 


is the same as the set 
Romo g o 


Occasionally, we will dispense with formality and simply write the descriptive 
form of the set as 
{all primes less than 10}. 


The notation 
rey 


13 
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says that x is an element of y. Thus, 
3 € {2,3,5,8}. 
On the other hand 4 is not an element of {2,3,5,8}, and this is displayed as 
4 ¢ {2,3,5, 8}. 
Sets x and y are said to be equal, written 
L=y 


if every element of x is an element of y and every element of y is an elements 
of x; in other words, sets are equal if they have the same elements. 

Both {2,3,5,B} and {B,5, B,2,3} display exactly the same set. If we 
happen to repeat elements in one display or if we display the elements in a 
different order it does not change the set. As another example, 


{3,5} = {5,3}. 


1.2 Everything from nothing 


The empty set Q is the set that contains no elements at. all: 
0={}. (1.1) 
For the empty set any statement such as 
aed 


is false, for 0 contains no element. 
Notice that the set 
{0} 
is not empty: it contains one element, that being Q. This is a bit confusing, 
so think it over. Thus: 


{0} AO (1.2) 


because the set on the left contains one element whereas the set on the right 
doesn’t contain any element. 
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In fact, {0} has a name. It is just the mathematical definition of 1: 
1% {0}. (1.3) 
Having 1 and @ we can form another set: 


{0,1}. 
This, of course, is 2: 
22 {9,1}. (1.4) 


In this way we obtain the numbers 
eat des ee 
which, together, form our first infinite set: 
FO O8 nc h 


Here we have identified 0 and the empty set, but conceptually one thinks of 
0 as ‘the number of elements’ in the empty set rather than 0. 
Much more can be done. The negative numbers 


St) a aA a 


can also be constructed as sets, and then the rational numbers, such as 
—13/271. In fact, virtually every object in mathematics is a set. 

There is, of course, much more to numbers than simply names given to 
certain sets, but we will not pursue this direction further. It is also good to 
keep in mind that our notion of numbers, both for counting and for ordering 
(such as listing items as first, second, third, etc.) is ancient, whereas the 
formalizing of this notion within set theory is barely over a hundred years 
old. 


1.3 Subsets 


If x and y are sets, we say that x is a subset of y if every element of x is an 
element of y; we denote this by 


rCy. 
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For instance, 
{1,2,5,6} C {0,5,6,4, 7,2, 3}. 


Note that every set is a subset of itself: 
ZC. 

Here are a couple more simply observations: 

eifecyandy C2 thens=y; 

eifecyandy Czthen 2 Cz. 


Sometimes one gets confused between a € band x C y. These are different 
notions. For example, 


3 € {1,3,5} 


but 3 is not a subset of {1,3,5}. But here is a somewhat twisted example: 
for the set 


a= {1,{1}} 


we have both 
{1} €a, and {1} €a. 


But this is unusual. 
One can do strange things with the empty set, always using arguments 
by contradiction. Here is a starter instance of this: 


Proposition 1.3.1 The empty set is a subset of every set: 


dcx for all sets x. 


Proof. We argue by contradiction. Suppose x is a set and @ is not a subset 
of x. This would mean that @ contains some element that is not in x. But @ 
contains no element at all. Thus we have a contradiction, and so 0 is in fact 
a subset of x. | QED 
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1.4 Union, Intersections, Complements 


The union of sets x and y is the set obtained by pooling together their 
elements into one set. The union is denoted 


rUy. 


For example, 
41, S07 433-5; 0.6} 38, Ot 


One can do unions of any family of sets. For example, 
LSAT ON (VO BE Ui A OB 


The intersection of sets x and y is the set containing the elements that 
are both in x and in y, and is denoted 


rny. 


For example, 
(1,3; 5,0): 142,4,6, 3,8} = 13,6}. 


The intersection can be empty of course: 
12:4 oad Bee) 


If the intersection of sets x and y is empty we say that x and y are disjoint. 
One can take intersections of more than two sets as well: 


{1,5, 3, 6} M {2, 3, 4} n {3, 8, 9} = {3}. 


Sometimes we are working within one fixed big set X. Then the comple- 
ment of any given subset A C X is the set of elements of X not in A: 


AS={pEX : p¢€ A}. 


1.5 Integers and Rationals 
The numbers 0, 1, 2,3,... along with their negatives form the set Z of integers: 


PING NO Oi ae aM (1.5) 
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Taking ratios of integers yields the rational numbers. Thus a rational 
number can be expressed as 


pP/4; 


where p and q are integers, with q 4 0 of course. For example, —34/15 is 
rational. The set of all rational numbers is denoted 


Q={p/q : p,g € Z, q F Of. (1.6) 


As we know there is a whole algebra for the rationals: they can be added, 
substracted, multiplied, divided (not by zero). Moreover, there is an order 
relation on Q, telling us which of two given rationals is bigger. 

For calculus we need the set R of all real numbers. This is a much larger 
set than Q, as it contains irrational numbers such as V2 and 7, in addition 
to all the rational numbers. 


1.6 Cartesian Products 


The displays {a,b} and {b, a} describe the same set, but sometimes we need 
to express a appearing first followed by b. For this purpose there is the notion 
of an ordered pair: 


(a, b). 


There are a number of ways to construct a set out of a and b, reflecting the 
distinction between them; a simple (though certainly not obvious) strategy 
is to define 


(a,b) = {{a}, {a, bf}. (1.7) 


We can check that, with this definition, 
(a, b) = (c,d) if and only if a=c and b=d. 


Thus, for instance, 
(1,3) # (3,1). 


Next, from sets A and B we can construct the set of all ordered pairs (a, b), 
drawing the first entry a from A, and the second entry b from B: 


Ax B={(a,b):aeEA, bE B}. (1.8) 
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This is called the Cartesian product of A with B. For example, 
{2,5, 6} x {dg} = {(2, 4), (2,9), (5,4), (5,9), (6, 4), (6, g)}- 
The Cartesian product of a set A with itself is denoted A?: 
A=AXA. (1.9) 


Thus the plane, coordinatized by real numbers, can be modeled mathemati- 
cally as 


R? = {(z,y) :c ER, ye R}. (1.10) 


1.7 Mappings and Functions 
In calculus we work with functions specified by formulas such as 
y= rte? tl. 


This relation is not read as simply an equality of two quantities y and x? + 
x? +1, but rather as a procedure for computing one quantity from the value 
of another: 


given the value x = 2 we compute y = 2?+ 2?+1= 13. 


Thus what we have here is a prescription: an input value for x leads to 
an ouput value y. Of course, the letters x and y are in themselves of no 
significance; the same function is specified by 


gat EP eT 


Sometimes a function is specified not by a formula but by an explicit descrip- 
tion; for example, 


1 if mis a prime number; 
1 rime (778) = 
P : ; . 
0 if m is not a prime number. 
specifies a ‘function of m’, where m runs over the positive integers. For 


example, 
1 prime (5) = I, and Lerime(4) = 0. 
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To formalize the notion of a function as producing an output value from 
an input value drawn from some given set in set language, observe that the 
function is essentially known completely if we are given the value f(t) for 
every relevant t; this information can be encoded by providing the set 


{(t, f(t)) :t running over a given set of interest}. 


Note that to keep things unambiguous, f(t) should mean exactly one unique 
value, and not multiple values. Thus, for example, 


y=tv1—2? 


is a relation containing meaningul information but we will not use the term 
‘function’ for this. 

We can now turn to a formal definition that reflects this notion. 

A map, mapping, or function 


f:A-B 


is specified by a set A, called the domain of f, a set B, called the codomain 
of f, and a set Gr(f) of ordered pairs (a,b), with a € A and b € B, such 
that for each a € A there is a unique b € B for which (a,b) € Gr(f). If 
(a,b) € Gr(f) we denote b by f(a): 


b = f(a) means (a,b) € Gr(f). 


The set Gr(f) is the graph of the mapping f. 

The term function is used normally, instead of mapping, in calculus. 

In order to make progress we also need to use language shortcuts. For 
example, instead of saying the function, with domain and codomain the set 
of real numbers and having graph 


{(a,a”) : a€ R}, 


we often simply write 
the function y = x”, 


it being clear, unless otherwise spelled out, that x runs over all real numbers. 
Thus, when we say 
the function y = x 
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Figure 1.1: Graph of y = x 


+¥ 


(= 030) S99 1 


x 


Figure 1.2: Graph of y = 2° — 2? —22+1 


we mean the function x whose value at any p is p itself; for instance, 
(3) =3, 2(—4.75) = —4.75. 


The graph of y = x is displayed visually as a straight line: 

The graph of y = x° — x? — 2x + 1 is displayed visually as 

Often in calculus, the codomain is clear from context (usually, the set 
R of all real numbers) and we identify a function with its graph, ignoring 
specification of the codomain. 

The range of a function f is the set of all values it takes: 


Range(f) = {f(a) :a@€ domain of f }. 
For example, for the function 
y =x" for all real numbers 2, 
the range is the set of all non-negative real numbers 


[0, oo). 
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For the function lprime we considered before, the range is the set {0,1}: 
Range(1prime) = {0, 1}. 
Returning to our earlier example 
y= +v1— 2”, 
observe that this is equivalent to 
rity =], 


This can be viewed as its graph: 


{(z,y)ER: ceERyeR}, 


which is the circle of unit radius, centered at the origin (0,0). 


n 


Figure 1.3: Graph of the circle x? + y? = 1. 
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1.8 Sequences 


A function whose domain is the set 
N = {1,2,3,...} 


of positive integers is called a sequence. For example, 
1 
f:NOR:nvw- 
n 


is a sequence. It is conventional to use n in place of x, and write f,, in place 
of f(n). In the preceding example we may describe it as 


the sequence f, = 1/n, 
or, more formally, as 
the sequence (fn)n>1 where fn, = 1/n, 


or, most simply, as 
the sequence 1/n. 


Here you have to underdstand from the context that n runs over N, and we 
are, strictly speaking, talking about the function that associates 1/n to every 
n €N. The way to think about the sequence is to think of it as a list of its 


values: 
1 1 1 1 


Te> 1", 38, aes 
Sometimes there is no ‘formula’ for the n-th term; for example, 


the sequence p, = n-th prime number. 


(There are infinitely many prime numbers and so this does indeed specify a 
sequence. ) 

In some contexts it is useful to take the domain of a sequence to include 
O as well. For example, the factorials 


J, =n for all n € {0,1,2,...} 
are secified by 
(fai Ih, Wala o Slaviea 198 Gathsainati: 
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Chapter 2 


The Extended Real Line 


In this chapter extend the real line R by including a largest element oo and a 
smallest element —oo. Using these makes it possible to write many theorems 
in a simpler way, without having a list of qualifiers of which situations need 
to be excluded. 


2.1 The Real Line 


The numbers 0,1,2,... arise from the notion of counting. From these it 
is possible to construct negative numbers —1,—2,... and then the rational 
numbers. 


Geometry leads us beyond the rationals and forces us to bring in other 
numbers. Consider straightline /, and on it pick a special point O and another 
point U. 


corresponds to the 
real number OP/OU 


Figure 2.1: Real numbers as ratios of segments 
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Then for any point P on the line / we can think of the geometical concept 
of the ratio 
OP/OU, 


where we take this to be negative if P is on the opposite side of O from 

U. Such ratios can be added and multiplied by using geometric construc- 

tions (these geometric operations on segments were described in Euclid’s 

Elements). Thus they form a system of numbers called the real numbers. 
For example, if P is just the point U then the ratio 


OP/OU = OU/OU 


corresponds to the number 1. Similarly, we have points P for which OP/OU 
is a rational such as —4/7. But there are also points P for which OP/OU 
cannot be expressed as a ratio of integers. 

For example, consider a right angled triangle that has two sides of length 
OU. Then the diagonal is, by Pythagoras’ theorem, has the ratio to OU 
given by V2. It is a fact that V2 is not a rational number, in that there is 
no rational number whose square is 2. 

The rationals are dense in the real line: between any two distinct reals 
there lies a rational. The irrationals are also dense in the real line: between 
any two distinct reals lies an irrational. 


2.2 The Extended Real Line 


The extended real line is obtained by a largest element oo, and a smallest 
element —oo, to the real line R: 


R* = RU {-—co, co} (220) 


Here oo and —oo are abstract elements. We extend the order relation to R 
by declaring that 


= 06 << 60 for allx eR (2.2) 


Much of our work will be on R*, instead of just R. 
We define addition on R* as follows: 


Z+oo = co =o00+8 forall z € R* with x > —co (2.3) 
y + (—0o) —oo = (—0o)+y for all y € R* with y< oo. (2.4) 
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Note that 
oo + (—o0) is not defined, 


which just means that there is no useful or consistent definition for it. 
For multiplication we set 


Lo = w=o-s forallz ec R* withrz>0 (2.5) 
x-(—oo) = -—oo =(-oo)-2 for all x € R* with z < 0. (2.6) 


It is a bit dangerous to define oo - 0. However, playing it very carefully, it 
turns out to be useful to set 


o-0=0-c0o=0 

(2.7) 
(—oco) -0 =0- (—co) = 0. 

This convention is quite useful when we study integration theory, but it 

should not be used in other parts of calculus, such as the study of limits. 

A definition, aside from being an identification of a distinctive concept, is 

meaningful and necessary only in so far as it is useful in formulating results. 

For example, 0/0 is undefined not because somehow we have arbitrarily de- 

cided not to define it, but any definition for it would be a dead end, of no 

use elsewhere in mathematics. 

Some familiar algebraic facts are still valid in R*: 


rty=yta, rt+(yt+z)=(r@+y)+2, (2.8) 


whenever either side of these equations exist (that is we don’t have 00+ (—oo) 
appearing). 
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Chapter 3 


Suprema, Infima, Completeness 


In this chapter we examine a fundamental property of the real line that 
distinguishes it from the rationals and that makes much of calculus possible. 
This property is called completeness, and roughly it means that the real line 
has no ‘gaps’ or ‘holes’. 


3.1 Upper Bounds and Lower Bounds 


Consider a set S C R*. 
A point p € R* is an upper bound of S if it lies to the right of S: 


<p forallxz é€ S. 


For example, for the set 
{3, 4, 6} U (8, 9] 


upper bounds are all numbers > 9. Note that 9 is also an upper bound. 

For the entire real line R the only upper bound is oo. If we had restricted 
ourselves to working only with real numbers then R would not have an upper 
bound. 

Note that oo is an upper bound for every subset of R*. 

Here is a mind twister for the empty set: 3 is an upper bound for @. As 
usual, to prove this we argue by contradiction. Suppose 3 were not an upper 
bound of @. This would mean that 3 is less than some x € @): 


3<2 for some x € @. 
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But the empty set has no element and so no such «x exists. Thus, 3 must 
indeed be an upper bound for 0. 

Is there anything special about 3 in the preceding argument? Definitely 
not! Thus: 


every value in [—co, oo] is an upper bound of 0. 


Now we turn to the flip side of the concept of upper bound. A point 
p € R* is a lower bound of S if it lies to the left of S: 


Dit for alla € S. 
Thus, for example, for the set 
(8, 9] U [11, co) 


all p < 8 are lower bounds. Note that 8 is also a lower bound. 

The value —oo is a lower bound for every subset of R*. 

Returning to the strange case of the empty set the same logical gymnastics 
show that every value in [—co, oo} is a lower bound of 0. 


3.2 Sup and Inf: Completeness 


Consider any S' C R*. 
The smallest upper bound of S, that is the least upper bound of S, is 
called the supremum of S and denoted 


sup 9. 


For example, 
sup(8, 9] U [11, 00) = oo. 


The largest lower bound, that is the greatest lower bound, of S is called 
its infimum and denoted 
inf S. 


With the example above we have 


inf(8, 9] U[11,00) =8. 
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If you think about the empty set, a strange thing happens. Recall that 
every value in R* is an upper bound of @ and so the least upper bound is 
—0oo: 

sup @ = —oo. 


Similarly, 
inf @ = co. 


Thus, the supremum of the empty set is actually Jess than the infimum! 
Here is a fundamental property of R*: 


Every subset of R* has a supremum and an infimum. (3:1) 


This is called the completeness of R*. 
If we want to stay within just the set of real numbers, the statement is a 
little bit more complicated so as to rule out all the infinities: 


If ACR is not empty and has an upper bound then it has a supremum, 
(3:2) 


and an analogous statement holds for the infimum. 

The completeness of R is a crucial property. It does not hold for rationals: 
roughly speaking if we draw Q on a line there will be lots and lots of holes, 
for instance the point where /2 would be missing. Many of the great useful 
results of calculus would fail on Q just for this reason. 

The completeness property can be taken to be an axiom in one approach 
to the study of the real number system. But in another, more construc- 
tive ,approach, where R is constructed out of set theory, completeness is a 
property that is proved as a fundamental theorem about R. 


3.3 More on Sup and Inf 


Consider a non-empty set S C R*. Pick some point p in S. Then any lower 
bound of S is < p and every upper bound of S' is > p: 


any lower bound of S < p < any upper bound. 


In particular, 


the greatest lower bound of S < p < the least upper bound. 
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Thus, 
infS <p<supS forallpe S. 


If S contains just one point then the inf and sup coincide: for example, 
inf{3} = 3 = sup{3}. 
On the other hand 
inf S<supS_ if S contains more than one point. (3.3) 
Now consider another situation. Consider sets 
BCACR. 


Thus everything in B is also in A. Any upper bound of A is > all elements 
of A and hence is > all elements of B. Thus: 


every upper bound of A is an upper bound of B. 
In particular, 
the least upper bound of A is an upper bound of B. 
In other words: 


sup A is an upper bound of B. 


So, of course, 
the least upper bound of B is < sup A. 


Thus, 
sup B < sup A if BCA. (3.4) 


Picking a smaller set decreases the supremum, where smaller means that it 
is contained in the larger set. (‘Decreases’ is in a lose sense here, as it may 
happen that sup A is equal to sup B.) 

By a similar reasoning we have 


inf A< inf B if BCA. (3.5) 


Picking a smaller set increases the infimum, with qualifiers as before. 


Chapter 4 


Neighborhoods, Open Sets and 
Closed Sets 


In this chapter we study some useful concepts for studying the concept of 
nearness of points in R*. 


4.1 Intervals 


An interval in R* is, geometrically, just a segment in the extended real line. 
For example, all the points x € R* for which 1 < x < 2 is an interval. More 
officially, an interval J is a non-empty subset of R* with the property that 
for any two points of J all points between the two points also lie in J: if 
s,te J, with s <t, andifs<p<tthenpe J. 

Let J be an interval, a its infimum and 6 its supremum: 


a=infJ, and b=supJ. 


Consider any point p strictly between a and b. Since a < p, the point p is 
not a lower bound and so there is a point s € J with s < p. Since b > p, 
the point p is not an upper bound, and so there is a point t € J with p < t. 
Thus 


s<p<t. 


Since s,t € J it follows that p, being between s and ¢, is also in J. The 
endpoints a and b themselves might or might not be in J. Thus we have the 
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following possibilities for J: 


b= {re R* :a<ar<b} 
[a,b) = {re Re 4a oD} 
b= {t@ €R* : a<a< bd} 


[a,b) = {re Re a <a <b}: 


(4.1) 


An interval of the form [a, ] is called a closed interval, and an interval of 
the form (a,b) is called an open interval. Thus a closed interval contains it 


two endpoints, while an open interval contains neither endpoint. 


4.2 Neighborhoods 


A neighborhood of a point p € R is an interval of the form 


where 6 > 0 is any positive real number. For example, 
(1.2, 1.8) 


is a neighborhood of 2. 
A typical neighborhood of 0 is of the form 


(=e €) 


for any positive real number e. 
A neighborhood of oo in R* = RU {—co, oo} is a ray of the form 


(t,co] = {x € R*: 2 >t} 
with ¢ any real number. For example, 
(5, 00] 


is a neigborhood of oo. 
A neighborhood of —oo in R* is a ray of the form 


[—co,s) = {zx € R*:2< s} 


DRAFT Calculus Notes 11/17/2011 35 


where s € R. An example is 


[—00, 4) 


Observe that if U and V are neighborhoods of p then UM V is also a 
neighborhood of p. (In fact, for the type of neighborhoods we have been 
working with, either U contains V as a subset of vice versa, and so UM V is 
just the smaller of the two neighborhoods.) 

Observe also that if NV is a neighborhood of a point p, and if q € N then q 
had a neighborhood lying entirely inside N. For example, the neighborhood 
(2,4) of 3 contains 2.5, and we can form the neighborhood (2,3) of 2.5 lying 
entirely inside (2, 4). 

Here is a simple but fundamental observation: 


Distinct points of R* have disjoint neighborhoods. (4.2) 


This is called the Hausdorff property of R*. 
For example, 3 and 5 have the neighborhoods 


(2,4) and (4.5, 5.5) 
The points 2 and oo have disjoint neighborhoods, such as 
(—1,5) and (12, co] 
Exercise Give examples of disjoint neighborhoods of 

(i) 2 and —4 

(ii) 

(iii) co and —oo 

(iv) 1 and -1 


—oo and 5 


4.3 Types of points for a set 


Consider a set 


Sc R*. 


A point p € R* is said to be an interior points of S if it has a neighborhood 
U lying entirely inside S: 


Les. 
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For example, for the set 
E = (—4, 5] U {6, 8} U [9, oo, 


the points —2,3,11 are interior points. The point oo is also an interior point 
of E. 

A point p is an exterior point if it has a neighborhood U lying entirely 
outside S: 

Was. 

For example, for the set FE’ above, points —5, 7, and —oo are exterior to E. 

A point that is neither interior to S nor exterior to S is a boundary point 
of S. Thus p is a boundary point of S if every neighborhood of p intersects 
both S and S*°. 

In the example above, the boundary points of FE are 


—4, 5, 6, 8, 9, co. 
Next consider the set 
{3} U (5, 00) 
The boundary points are 3,5, and oo. It is important to observe that if we 


work with the real line R instead of the extended line R* then we must exclude 
oo as a boundary point, because it doesn’t exist as far as R is concerned. 


Example For the set A = [—o0, 4) U {5,9} U [6, 7), decide which of the following 
are true and which false: 


(i) —6 is an interior point (T) 
(i) 

(iii) 

(iv) 5 is an interior point (F) 


Ecxerise For the set B = |—oo, —5) U {2,5,8} U [4,7), decide which of the fol- 
lowing are true and which false: 


6 is an interior point (F) 


9 is a boundary point (T) 


(i) —6 is an interior point 
(ii) —5 is an interior point 
(iii) 5 is a boundary point 
(iv) 4 is an interior point 

) 


(v) 7 is a boundary point. 
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4.4 Interior, Exterior, and Boundary of a Set 
The set of all interior points of a set S is denoted 
5° 


and is called the interior of S. 
The set of all boundary points of S is denoted 


OS 


and is called the boundary of S. 
The set of all points exterior to S is the exterior of S, and we shall denote 
it 
Sexe 


Thus, the whole extended line R* is split up into three disjoint pieces: 


SS WoOs. Ue (4.3) 


Recall that a point p is on the boundary of S if every neighborhood of 
the point intersects both S and S*°. But this is exactly the condition for p to 
be on the boundary of S°. Thus 


as = ase. (4.4) 


The interior of the entire extended line R* is all of R*. So 


OR* = 0. 


Moreover, 


Oo = 0. 


At another extreme of unexpected behavior is the set Q of rational num- 
bers. If you take any neighborhood U of any point in R* then U contains 
both rational numbers and irrational numbers. Thus, every point in R* is a 
boundary point of Q: 


dQ = R*. 


Example For the set A = [—o0, 4) U {5,9} U [6, 7), 


(i) A° = [—c0, 4) U (6, 7) 
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(ii) OA = {4, 5, 9, 7, 6} 
(iii) A° = [4,5) U (5, 6) U[7, 9) U (9, oo] 
(iv) the interior of the complement A° is 
(A°)° = (4,5) U (5, 6) U (7,9) U (9, oo] 
For the set 
G = (3/08) 


the boundary of G, when viewed as a subset of R*, is 


OG = {3, oo}. 


But if we decide to work only inside R then the boundary of G is just {3}. 


Excerise For the set B = {—4,8}U[1,7) U [9, oo), 


(i 
(ii 


(iii 


) B 
) OB= 

) B 
(iv) the interior of the complement B° is 


(Ber _ 


4.5 Open Sets and Topology 


We say that a set is open if it does not contain any of its boundary points. 
For example, 
(2,3) U (5,9) 
is open. The set 
(3, 4] 


is not open, because it contains 4, which is a boundary point. On the other 
hand 
(4, o0] 


is open (even though it is not what is usually called ‘an open interval’). 
The entire extended line R* is open, because it has no boundary points. 
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The empty set @ is open, because, again, it doesn’t have any boundary 
points. 

Notice then that every point of an open set is an interior point. Thus, a 
set S' is open means that 

Stas, 

Thus for an open set S each point has a neighborhood contained entirely 
inside S. In other words, S is made up of a union of neighborhoods. 

Viewed in this way, it becomes clear that the union of open sets is an 
open set. 

It can also be verified that: 

The intersection of a finite number of open sets is open. 


Exerise Check that the intersection of the sets (4,00) and (—35) and (2,6) is 
open. 


The collection of all open subsets of R is called the topology of R. 
The set of all open subsets of R* is called the topology of R*. 


4.6 Closed Sets 


A set S' is said to be closed if it contains all its boundary points. 
In other words, S is closed if 


05-5 
Thus, 
[4, 8] U [9, oo] 
is closed. 
But 
[4, 5) 


is not closed because the boundary point 5 is not in this set. 
The set 
3, 00) 


is not closed (as a subset of IR*) because the boundary point oo is not inside 
the set. But, viewed as a subset of R it is closed. So we need to be careful in 
deciding what is close and what isn’t: a set may be closed viewed as a subset 
of R but not as a subset of R*. 
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The full extended line R* is closed. 
The empty set @ is also closed. 
Note that the sets R* and @ are both open and closed. 


4.7 Open Sets and Closed Sets 


Consider a set S' Cc R*. 
If S is open then its boundary points are all outside S: 


OSG": 


But recall that the boundary of S is the same as the boundary of the com- 
plement S°. Thus, for S to be open we must have 


a(s*) c $°, 


which means that S° contains all its boundary points. But this means that 
S© is closed. 

Thus, zf a set is open then its complement is closed. 

The converse is also true: if a set is closed then its complement is open. 
Thus, 


Theorem 4.7.1 A subset of R* is open if and only if its complement is 
closed. 


Exercise. Consider the open set (1,5). Check that its complement if 
closed. 

Exercise. Consider the closed set [4,00]. Show that its complement is 
open. 


4.8 Closed sets in R and in R* 


The set 
[3, 00) 


is closed in R, but is not closed in R*. This is because in R it has only the 
boundary point 3, which it contains; in contrast, in R* the point oo is also a 
boundary point and is not in the set. Thus, when working with closed sets it 
is important to bear in mind the distinction between being closed in R and 
being closed in R*. There is no such distinction for open sets. 


Chapter 5 


Magnitude and Distance 


5.1 Absolute Value 


The absolute value or magnitude of x* € R is the measure of how large x is, 
without regard to its sign; the absolute value of x is defined by 


x i: 
jz] = (5.1) 


—x ifa<0. 


In the second line —2 helps flip the sign of a negative value of x to a positive 
one: 


| -3| = -(-8) =3. 


Note that 
[0] = 0 


and 


|x| >0 for all z € R*. 


Another observation that comes in handy occasionally is: 


—|r|<a2<|z| forallz eR’, (5.2) 


and in fact, of course, x is equal to either |x| (if « > 0) or —|x| (if x < O). 
This gives another useful specification of |x|: 


|x| is the larger of the numbers x and —a. 


Al 
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As a formula this is: 
|z| = max{z,—z} forallceR. (2.3) 


In other words, |x| is x or —x, whichever is > 0. 
It is clear that 


| — z| = |z| for all x € R*. (5.4) 


5.2 Inequalities and equalities 


If we take two non-negative numbers, say 3 and 5 then we have 
[3 + 5| = 8 = [3] + [5]. 
The same works if both numbers are negative: 
ear lad) 8 = 8) ar |= 5). 


But if one is positive and the other negative then the sum of the absolute 
values wins out over the absolute value of the sum: 


[5 + (—3)|=2 < |[5]+]|—- 3). 
We can summarize this in the triangle inequality for magnitudes: 


ja + b| < Jal + [d], (5.5) 


for all a,b € R* excluding, as always, the cases co + (—oo) and (—co) + co. 
Here is a proof of this: since |a + }| is the larger of a+b and —(a+ b), we 
just need to show that both of these are < the sum |a|+|b|. For this observe 
first that 


a+b<|a|+ |b| because a < ja] and b < |b] 
and then observe that 
—(a+b) = (—a) + (—b) < ja] + |b] because —a < ja] and —b < |b). 
Thus both a+b and —(a + b) are less or equal to |a| + |b], and so the larger 


of a+b and —(a + 6) is < |a| + |b|. This proves (5.5). 
For multiplication we have equality of absolute values: 


Jab] = |al|b| (5.6) 


for all a,b € R*. You can check this by considering all possible choices of 
signs for a and b. 
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5.3 Distance 


The distance between two real numbers a and b is the magnitude of a — b: 


def 


d(a,b) = |a— 0]. (5.7) 


Here are two basic properties of distance: 


d(a,a)=0 forallaeR, (5.8) 


and 
if d(a, b) = 0 then a = b. (5.9) 


There is a third, less obvious, property that is called the triangle inequality 
that is of great use: 


d(a,c) < d(a,b) + d(b,c) for alla,b,cER. (5.10) 
This follows from the triangle inequality for mangitudes: 
d(a,c) =|a—c|+|a—b + b-c| < |a—6] + |b—c| = d(a,b) + d(a,c). 


The specific measure of distance given by (5.7) is completely natural and 
intuitive but does not extend nicely to R*. Other measures of distance can 
be constructed that work on R*. 


5.4 Neighborhoods and distance 


Consider a point p € R and a neighborhood of p given by 
(p—6,p+d)={tER: p—d<2<p+35}, 


where 0 is a positive real number. Clearly this neighborhood consists exactly 
of those points x whose distance from p is less than 6. Thus we have 


(p—6,p+6)={2 ER: |x—p| <d}={xeER: d(z,p)<d}. (5.11) 
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Chapter 6 
Limits 


The concept of limit is fundamental to calculus. It is very easy to grasp 
intuitively but quite difficult to pin down in a completeley precise mathe- 
matical way. For example, anyone would agree that x? approaches 9 when x 
approaches 3; but explaining exactly what this means is a subtle matter. In 
a first run through the theory it may in fact be practical to give up on this 
precise specification and just rely on intuition. But using out technology of 
sup and inf makes it somewhat easy to come to grips with the exact meaning 
of #7? > 4asn 2 3 3. 


For the discussions in this chapter and also elsewhere there is some no- 
tational care that is needed in working with values f(x) of a function f. 
Clearly for such quantities, the point x itself must be in the domain of f. 
Consider a function f with domain S C R, and let p be a point in S. If U is 
a neighborhood of p then part of U might not be inside S, and so when we 
speak of the values of f on U we need to focus on f(x) forz € UNS. For 
instance, the function f given by 


f(x) = Vaz for x € [0, 00) 


has domain S = [0,00), and if we take p to be the point 0 then a typical 
neighborhood, such as (—.01,.01) of p, falls partly outside S. Putting the 
restrciction c € UM S makes it possible to talk about the value f(z). 
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6.1 Limits, Sup and Inf 


Consider the function g defined on all real numbers through the formula 


__ ee? eee 
I=) itn go 


Intuitively it is clear that as x approaches 2 the value g(a) approaches 2? = 8; 
note that the actual value g(2), which is given to be 0, is irrelevant to this. 
We write this symbolically as 


g(t) 78 asx—2 
or, even more compactly, as 


lim g(x) = 8. 


Lr2 


We read this as “g(a) has the limit 8 as 2 approaches 2’. 

Our goal is to pin down the exact meaning of this. For this consider the 
behavior of the values g(x) when z is restricted to some neighborhood, say 
(1.5, 2.5) of 2, again ignoring the actual value g(2): 


{g(z) : 2 €(1.5,2.5) and x4 2}. 
How high does g(x) get here? Clearly it is 
sup{g(x) : © €(1.5,2.5) and «x 42} = (2.5)? = 15.625 
and, on the lower side we have 
inf{g(x) : x € (1.5,2.5) and x42} =(1.5)* = 3.375. 
There is a simpler notation for these sups and infs: 


sup g(x) = 15.625 
x€(1.5,2.5),cA2 
inf S250. 
ne (1.5.2.5) 092 9) 
We can improve our understanding of the behavior of g(a) for x approaching 
2 by focusing on a smaller neighborhood of 2, say (1.9, 2.1). For this we have 
sup. 4(@) = 9.261 
xE(1.9,2.1),vA2 


inf = 6.859 
etios Nae g() 
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As we have shrunk the neighborhood the supremum has decreases and the 
infimum has increased. But notice that the number 8 (our suspect for the 
limit) lies between the sup and the infin both cases. Indeed intuition, and in 
this case easy verification, suggests that: 


the limit lim, ,2 g(x) always lies between the sup and inf of the 
values of g(x) on neighborhoods of 2 (always excluding the value 
p=); 


In fact, 
lim g(x) 


r—2 
is the unique value that lies between the sup and inf of the values of g(a) on 
all neighborhoods of 2 (always excluding the value x = 2). 
This provides us with an official definition of limit: 


Definition 6.1.1 Let g be a function defined on a set S C R and let p be 
any point in R*. We say that g(x) approaches the limit L € R* as x > p, 
writing this as 


lim 9(z) = L, 
if L is the unique value that lies between the sups and infs of the values of g 
in neighborhoods of p (excluding p itself): 


inf g(a) <L< sup g(a Bt 
aa )s ~ ae iw ) ee 


The reason for using x € US is that g(x) is only be defined for x in the 
set S. On eother point is that if in fact U MS contains no point other than 
p then the inf and sup in are over the empty set and so can never 
hold (for the left side is co and the right side is —oo) for any value of ZL and 
so the limit does not exist in this case. Thus there is no possibility of the 
limit existing if p is not a limit point of S. It is best not to worry about these 
fine points too much at this stage. 

A note of caution: the above definition is not the standard one but is 
equivalent to it. 

Official definitions of the notion of limit are useful in proving theorems but 
nearly useless in actually computing limits except for very simple functions. 
We look at two such simple examples now just to make sure the definition 
produces values in agreement with common sense. 
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Take for a starter example, the constant function 
Kig)=5 for allx ER. 
We want to make sure that the official definition 6.1.1] does imply that 
linn Iie) = 5. 


23 
To check this consider any neighborhood of 3: 
(3 — 6,3+), 
where 0 is any positive real number. Then 


sup KG) =5 
r€(3—6,3+6),cF3 


because the set of values K(x) is just {5}, and also 


inf K(e) =o. 
x€(3—6,3+6), 243 
Thus the only value that lies between the sup and the inf is 5 itself, and 


hence 
lini JC (2) =O. 


x23 


Now let us move to the function 


f(z)=a foralzeR. 


We would like to make sure that Definition does imply that lim, 55 f(x) 
is 6. Consider the neighborhood 


(6 —6,6+5), 
where 0 is a positive real number. Then 
{f(z) : «€(6—6,6+6)}={x: x € (6—6,6+4+5)}, 


which is just the interval (6 —6,6+ 0), but with the point 6 excluded. Hence 
its sup is 6 + 6 and its inf is 6 — 6. What value lies between these two no 
matter what 6 is ? Certainly it is 6: 


inf Jie =6 = sup {(@). 
rE (6—6,6-+6), x46 (2) r€(6—6,6+6) £6 (7) 
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Hence, 
lim f(x) =6. 
2-6 
In fact, it is clear that 
lim x = p, 
zp 
for every p € R*. (The case p = oo or p = —oo requires a special, but 


not difficult, argument, because neighborhoods of these points look different 
from the usual (p — 6,p+ 6) form.) 

Before moving on the fancier explorations here is a warning on notation. 
There is nothing special about 2, which we have been using in writing limits. 
Instead of lim,_,, f(a) we could just as well write lim,-_,, f(y) or lim,_., f(w): 


lim f(x) = lim f(y) = lim f(w) = lim f (blah). 


rp yp wp blah-p 


The only rule about notation is that it must be consistent: never use the 
same letter to mean two different things in the same equation or statement! 


6.2 Limits for 1/z 


Common sense shows that 
1/z > 0, as Z —> oo. 


We will first show that 


1 
inf. — =, 
xE(t,oo) XL 


for any positive real number t. 
Since 1/x > 0 when z is positive, 0 is a lower bound for such 1/z and so 
the greatest lower bound is > 0: 


. 1 
inf —>0. 
xE(t,co) XL 


We need only show that this inf cannot be > 0. Denote the inf by b: 


1 
b= inf —. 


xE(t,co) L 
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Suppose b > 0. Now 


1 
—<b 
xv 


whenever x > 1/b. So if we take some real number y larger than both 1/b 
and t, say y = t+1/b, then this x is in (t,o) and is also > 1/b and so 


1 
— <b. 
y 
Consequently, 


1 
inf — <b. 
xE(t,oo) XL 


This is impossible, since it is sayng b < b. Hence 
1 
inf —=0. 
xE(t,oo) XL 
This holds for all real t > 0. Hence the value 0 satisfies 


1 
inf. SO. sup = 
xE(t,co],rfoo XL xE(t,co],aZoo U 


for all positive real t. (It is a tiresome check to see that it holds also for 
t < 0, keeing in mind that 2 = 0 is excluded from the domain of 1/x.) Hence 


1 
lim — =0. (6.2) 
@—oo 1 
A similar argument shows that 
in =o (6.3) 
ine : 


There are two more limits associated with 1/x. Taking 1/z only on the 
domain of positive values of x we have 


1 
lm -—=o, (6.4) 


xz-0,2>0 © 


and focusing on negative values we have 


lim by —o. (6.5) 


x2>0,2<0 & 
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These are usually written as 


. tl 
lim — =oo 
270+ &@ (6.6) 
lim — = —oo 
z-0- & 


The limit of 1/x as x — 0 does not exist. You can check that in any 
neghborhood of 0, excluding the value x = 0 itself, the sup of 1/zx is co 
whereas the inf of 1/x is —oo, and so there is no unique value between these 
two extremes. 

6.3. A function with no limits 
Recall the set Q of all rational numbers: 


Q = {all rationals}. 


This is dense in R: 


every open interval in R contains rational points. 


The same is true of the trrationals: 


every open interval in R contains irrational points. 


Consider the indicator function of Q, taking the value 1 on rationals and 


0 on irrationals: 
1 ifxteEQ 
g(x) = (6.7) 
0 ift#égQ; 


If you take any p € R and any neigborhood U of p it is clear that 


sup lo(x)=1 and inf lo(x) =0. 
oe Q(x) ee Q(x) 


Thus there can be no unique value between these sups and infs, and so 


limz+p le(x) does not exist for any p € R. 
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6.4 Limits of sequences 


Recall that a sequence s is a function whose domain is the set N = {1,2,3,...} 
(and occasionally, we permit 0 in the domain). The value of the function s 
on the number n € N is denoted by 


Sn 
rather than s(n), and the function itself is generally written as 
(Sn)adt 


rather than s. 

Our definition of the limit lim,_,, f(~) makes sense for any function f 
with domain some subset S of R and with p being a limit point of S. Thus 
we can apply to the case of sequences, with S being N and p = oo. Fora 
sequence (S)n>1 we are interested in the limit 


lim s,,. 
n—-oo 


For example, 
1 


lim — = 0. 
no nN 
It is easy to believe that 
hin: 2” 60; (6.8) 


NCO 


One way to prove this officially is by using the inequality 
2° Sn for all m € {1,2,3,...}. (6.9) 


(2” is the total number of subsets that an n-element set, such as {1,2...,n}, 
has, and clearly this is more than n itself since each of the n elements itself 


provides a subset. ) 
As a consequence of we have 


Sometimes we work with infinite series sums: for a sequence (Sp),>1 the 


series sum 
CO 
sn 
n=1 
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is defined to be io 
So Sn = 81+ 824-9 = lim Sy, (6.10) 
rear N->oco 
where Sy is the N-th partial sum 
Sn = 8, +-::+5n. 


When working with series, very often the index 0 is allowed and one works 


with sums 
CO 
y Sis 
n=0 


We will not explore these ideas much further at this point. But before moving 
on let us work out one example. We will work out the value of the infinite 
series sum 


93 
(If you think about it, this surely should add up to 2 ... we will verify this 
intuition using mathematics. ) 

In the summation notation this is displayed as 


ar, 
Qn 

n=0 
Let us first work out the partial sum 


1 1 
DiS De tai, 


The clever trick at this stage is to multiply this by 1/2: 


t 1 1 1 
Sn =+a+---4 } Swat 


Observe that this is very similar to Sy itself: when we subtract nearly ev- 


erything cancels out: 
1 


— QN+1° 


1 1 


1 
Thus, 
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Now we have the expression for Sy: 


iho (6.11) 
2 QN y-h 


Letting N — co produces the infinite series sum 


1+-—+.ee= = 2. (6.12) 


6.5 Lim with sups and infs 


In this section we are going to push our grasp on sup’s and inf’s to the limit! 

Consider a function f on a set S C R and a point p € R*. To avoid 
unimportant technicalities let us assume that p is a limit point of S. We 
know of then that 


inf < : 
retitt ep f#) S ae, f (2) 
Now consider another neighborhood V of p. Both U and V contain the 
neighborhood 
W=UNV. 


Hence 


inf f(z)< inf fi) 


xeUNS,a¢p ( ) teWwns,c24¢p 


sup f(x)< sup f(z), 
2EWnS,x2#p xEVNS, «Ap 


(6.13) 


because shrinking a set raises the inf and lowers the sup. As a consequence 
we have 
inf  f(z)< sup f(e). (6.14) 


xeUNS,xAp xrEVNS,cA#p 


Thus the sup over any neighborhood of p is > the inf over all neighborhoods 
of p. 

Thus, there always exists a point that lies between the sups and infs of the 
values f(x) for x in neighborhoods of p, excluding x = p. If there is a unique 
such point then that point is lim,_,, f(z). 

We can reformulate the relation (6.14) as follows: the sup of f over any 
neigborhood of p is an upper bound for the set of all the inf values of f over 
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all neighborhoods of p. Hence: the sup of f over any neighborhood V of p is 
> the sup of the values infreuns¢p f(x) for all neighborhoods U of p: 


su inf z)< su Ma 6.15 
MD eae )s rey! | ) 


where supy is the sup over all neighborhoods U of p. Since this holds for all 
neighborhoods V of p we then conclude that 


su inf v) <inf su x 6.16 
oe ee )<in ek ) (6.16) 


where infy is the inf over all neighborhoods V of p. 

Pondering we observe that L = lim,_,, f(x) exists if and only if 
the two extremes supy infreuns,e¢p f(x) and infy sup,evns.e¢p f (x) are equal, 
and this common value must be L itself. 

To summarize: 


Proposition 6.5.1 Let f be a function defined on a set S CR, and p a limit 
point of S. Then limz-,, f(x) exists if and only if supy infreuns2¢p f(x) and 
infy SUP, cvasaetp J (2) are equal, and then 


inf = li = inf 6.17 
pes nf) = De) ee (6.17) 
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Chapter 7 


Limits: Properties 


In this chapter we explore the concept of limit and get familiar with some 
basic properties that make the computation of limits a routine process for 
many ordinary functions. 


7.1 Up and down with limits 


Suppose f is a function on R and 
lim f (ie): =5: 


Our intuition suggests that when x is near enough 3 (excluding x = 3 itself) 
the value f(x) should be > 4. Let us see how we can deduce this from the 
official definition of limit. 

Since f(x) > 5 as x — 3, the number 5 is the unique value that lies 
between the sups and infs of the values of f(x) for x (excluding 7 = 3) in 
any neighborhood of 3. Thus, for instance, 4 does not lie between these sups 
and infs. This means that there is some neighborhood, say U, of 3 such that 
4 is not between the sup and the inf of f over U, excluding f(3): 


A inf 
g as f(x) ue f(x) 


Note that this interval does contain 5 because that is actually the limit. So 
4 must lie below this interval: 
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This means that all the values of f on the neighborhood U of 3, excluding f(3) 
itself, are > 4. This is exactly what we had conjectured based on common 
sense intuition about limits. 

If you look over the preceding discussion you see that what makes the 
argument work is simply that 4 is a value that is < than the limit 5. Thus 
what we have really proved is this: 


Proposition 7.1.1 Suppose f is a function on some subset S C R, and 


b= lms @), 
Ip 


where p € R*. If b is any value below L, that is b < L then there is a 
neighborhood U of p on which 


f(x) > 6 for alla €UNS except possibly for x = p. 


We have had to write x € UNS, and not just x € U, because f(a) might 
not be defined for all x in U. 

It is important not to get bogged down in the notation used: keep in 
mind the essense of the idea. What we are saying is, in ordinary rough and 
ready language, if f(a) + L as x > p then the values f(x) lie above b when 
x is near p (but not p itself), for any given value b < L. 

Of course, we can do the same for values above the limit: 


Proposition 7.1.2 Suppose f is a function on some subset S C R, and 
LD = lim f(z), 
xp 


where p € R*. If u ts any value > L then there is a neighborhood U of p on 
which 
f(x) <u for allx EUNS except possibly for x = p. 


We can even put thee two observations together: 


Proposition 7.1.3 Suppose f is a function on some subset S CR, and 


L= lim f(z); 
rp 


where p € R*. If u is any value > L and b is any value < L then there is a 
neighborhood U of p on which 


b< f(x) <u for alla EUNS except possibly for x = p. 
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7.2 Limits: the standard definition 


Proposition can be recast and slightly broadened into the following 
characterization of the notion of limit: 


Proposition 7.2.1 Let f be a function on S C R and p € R*. Suppose 
L = lim,, f(x) exists . Then for any neighborhood W of L there is a 
neighborhood U of p such that 


f(x) €W for alla €EUNS, excluding x = p. 


Proof. Suppose first that LZ is a real number, and not oo or —oo. Then in 
the neighborhood W there is an open interval 


(b, u), 
centered at L, where b,u € R and b < u. Then by Proposition there is 
a neighborhood U of p for which 
b < f(x) < u for all x UNS except possibly for x = p. 


Thus, for all z € UNS, except x = p, the value f(z) lies in the interval (6, u) 
and hence in W. 
Now consider the case L = oo. A neighborhood of L then contains an 
interval of the form 
(0, oo}, 
for some real number 6. By Proposition [7.1.1] there is a neighborhood U of p 
for which 


f(x) > b for all x € UNS except possibly for x = p. 


Thus, for all c € UNS, except x = p, the value f(x) lies in the interval 
(b, co] and hence in W. 
Lastly, consider the case L = —oo. A neighborhood of L contains an 
interval of the form 
[—oo, u). 
By Proposition there is a neighborhood U of p for which 
f(x) < u for all x € UNS except possibly for x = p. 


Thus, for all c € UNS, except x = p, the value f(x) lies in the interval 
[—oo, u) and hence in W. | QED 


The result can also be run in reverse: 


60 Ambar N. Sengupta 11/6/2011 


Proposition 7.2.2 Let f be a function on S C R and let p € R* be a limit 
point of S. Suppose L € R* has the property that for any neighborhood W of 
L there is a neighborhood U of p such that 


f(x) € W for alla €EUNS, excluding x = p. (7.1) 
Then L = limg+5 f (2): 


Proof. Suppose first that Z is a real number, and not oo or —oo. We will 
show that L is the unique value that lies between the sups and infs of f(x) 
for x in all neighborhoods of p (excluding the the value at x = p). Suppose 
this were not true. Suppose U is a neighborhood of p such that 


2 
a (7.2) 


Pick any real number u between these two values: 


D<u< pe ere) (7.3) 


Consider then the neighborhood of L given by 
W = (6,4), 


where b < L and L is the center of the interval W. Then by the condition 
(7.1) there is a neighborhood U of p for which 


b< f(x) < u for all z € UNS except possibly for x = p. 


But this is impossible since, by (7.3), u < f(x) for all x € UNS with 
x # p (the assumtion that p is not an isolated point or an exterior point 
for S guarantees that UM S actually contains a point other than p). This 
contradiction shows that is false, and so 


. Ze. 
se ~~ - 


By a similar argument it also follows that 


L< sup f(z). 
xeEUNS,a2¢p 


Thus L lies between all the sups and infs for f as required. 
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But we still need to show that L is the unique such value. Consider any 
value L’ 4 L. Suppose first L’ > L. Consider a neighborhood W of L that 
does not contain L’. This means that L’ is > than all the values in W. By 
there is a neighborhood U of p such that f(x) € W for ala eUNS 
with x 4 p. Hence for all such values x we have f(x) < L’, and so L’ does nt 
lie below sup,cuns.2¢p f(x). Thus L’ cannot be the limit of f(a) as x > p. 
By just a similar argument no value < L could be the limit of f(x) as x > p. 

The cases L = oo and L = —o are settled by different but similar 
arguments, just keeping in mind that the neighborhoods of oo and —oo are 
‘one sided’ rays. | QED 

Because of the preceding two results the definition of limit has the fol- 
lowing equivalent formulation (the standard one): 


Definition 7.2.1 Let f be a function on a set S C R, and p be any limit 
point of S. A value L € R* is said to be the limit of f(x) as x > p if for any 
neighborhood W of L there is a neighborhood U of p such that f(x) € W for 
allxEeUNsS withz F¢ p. 


7.3 Limits: working rules 


It is an exhausting and largely pointless task to try to use the definition of 
limit directly in computing actual limits such as 


It is far more efficient to develop some working rules, basic results, using 
which more complicated limits can be reduced to simpler ones and then 
these worked out directly. 

There really are just two limits we have worked out directly from the 
definition: 


limk =k 

ad (7.4) 
lima =p 

rp 


for all constants kK € R and points p € R*. 
The first computationally useful result for limits is simply that the limit 
of a sum is the sum of the limits: 


lima (F(x) + g(x)] = lim f(@) + lim g(a). (7.5) 
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There are some qualifiers: we need to assume that the two limits on the right 
exist and that the sum on the right is defined (it isn’t of the form (—oo) + 00 
or co + (—oo)). Here is a formal statement: 


Proposition 7.3.1 Suppose f and g are functions on S C R, and suppose 
the limits limz+» f(x) and lim,-,, 9(x) exist, where p is some point in R*. 
Assume furthermore that 


{tim f(z), lim g(2)} # {00, —00}. (7.6) 
Then limz+p|f(x) + g(x)] exists and 


lim [f(v) + g(¢)] = lim f(v) + lim g(x), (7.7) 


«wp I—+>p 


The point of the condition (7.6) is to ensure that the sum on the right in 
(7.7) exists. 

We will not prove this result here. Instead we march on to the next result, 
focused on multiplication: 


Proposition 7.3.2 Suppose f and g are functions on S CR, and p is some 
point in R*. Then the limit lim,_,p|f(x)g(x)] exists and 


im [7 Cea] = (tim FCe)) (tim aa) ) (78) 


zp «rp «tp 


provided the two limits on the right and their product exist and this product 
is not of the form 0- (+00) or (400) -0. More clearly, the condition is that 
the limits lim,_,, f(a) and lim,_,, g(x) exist and 


{lim f(x), lim g(a)} # {0,00} 


and {lim f(z), lim g(x)} # {0, oo}. a) 


It would have been easier to state this result had we never defined 0-00 and 
oo - 0 as 0, and instead left such products as undefined. But the convention 
0- oo = 0 is very convenient for integration theory and so we hold on to it. 
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Here is a quick application of the preceding rules about limits: we can 
compute the limit of 77 +32r+4as a1: 


lim (2? + 3a + 4) = lim 2? + lim(3x + 4) (if this exists) 
x1 a1 x1 


= (iim x) (lim w) + (im 3) (iim x) + lim 4| (if this exists) 
rl a1 al x1 a1 
=1-1+4+[3-1+4| (and this does exist!) 
= 8. 
(7.10) 


This is the kind of reasoning one should go through once but it is clearly so 
routine that it is not worth mentioning all the steps every time. For 


lim(z? + 52 —2)=8+ 10-2, 
x2 


not only is the result (the value of the limit) perfectly obvious from common 
sense it is also perfectly obvious how to use the rules of limits to actually 
prove that the limit is indeed the value stated above. 

Going beyond multiplication we consider ratios. Note that a ration 


a 


b 


is not meaningful if the denominator b is 0 or if a and b are both +oo. It is 
useful, for the purposes of the next result to define 


oie = Gee, (7.11) 
OO —CO 


Proposition 7.3.3 Suppose f and g are functions on S C R, and p is some 
point in R*. Then the limit limz_+p ve exists and 


_ f(x) _ lime p f (2) 
a g(x) im dG) 


; (7.12) 


provided the two limits on the right and their ratio exist (this means that the 
ratio must not look like something/0 or too/ + 00). 


As a simple illustration of what can go wrong in using this let us look at 
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If we just do the ratio of the limits we end up with 0/0, and this is just a 
case where the preceding result cannot be applied. Thus we need to be less 
lazy and observe that 


P=9 (G=3)(G3) 


= + 3, 
xr—3 xr—-3 - 


from which it is clear that 


7.4 Limits by comparing 


Sometimes we can find the limit of a function by comparing it with other 
functions that are easier to understand. 

The so called ‘squeeze theorem’ is a case of this. Suppose f, g, and h are 
functions on a set S C R and p € R* is such that 


f(@) <h(@) < g(@) (7.18) 


for all x in S that lie in some neighborhood U of p, excluding x = p. Assume 
that lim,_,, f(x) and lim,-,, g(a) exist and are equal: 

C= lim f(z) = lim g(z). 
Then h(x), squeezed in between f(x) and g(x), is forced to also approach 


the same limit L. 
Here is a formal statement and proof: 


Proposition 7.4.1 Suppose f, g, and h are functions on a set S CR, and 
p € R* is such that 


f(@) S h(@) < g(@) (7.14) 


for all x in S that lie in some neighborhood of p, excluding x = p. Assume 
also that lim, f(x) and limy-,p g(x) exist and are equal: 


f= lime fee = lin g(x) 


“Lp «—-p 


Then lim,+, h(x) exists and is equal to L. 
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Proof. We will use the second formulation of the definition of limit, given in 
Definition (Since the limit lim,-_,, f(x) is assumed to exist, the point 
p is a limit point of S.) 

Consider any neighborhood W of L. Note that W is an interval containing 
L. 

Since lim,_,, g(x) = L, Definition|7.2.1]implies that there is a neigborhood 
U, of p such that g(x) € W for all x EU, NS with x F p. 

Similarly, there is also a neighborhood U2 of p such that g(x) € W for all 
x €UZNS with x ¥ p. Consider then 


Y=; 1 Us, 


which is also a neighborhood of p, contained inside both U; an Us». If we take 
any x € UNS, with x # p, then both g(x) and f(z) lie inside W, and so 
anything between g(x)) and f(a) also lies in W. Hence 


f(z) EW 


for alla € UNS, with « ¥ p. This proves that 


lim f(z) =L. |QED 


Here is a consequence that is easier to apply sometimes: 


Proposition 7.4.2 Suppose f is a function on a set S C R and p € R* ts 


such that 
lim |'F(¢)| =O 
rp 

Then 
lim f(x) =0 
rp 


Note that we cannot draw any conclusion if we known that |f(x)| > 5, some 
other nonzero value, because in that case f(x) could fluctuate up and down 
between 5 and —5. 

Proof. Any a € R is either equal |a| or to —|a| (if @ < 0), and we can 
certainly write 


—lal Saal. 


Hence 


—|F(@)| < F(@) < |F(@)| 
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for all z in S. As x > p both |f(x)| — 0 and —|f(x)| — 0, and so, by the 


‘squeeze’ theorem, f(x) > 0 as well. | QED 
Here is a quick application: 


. 3. 1 
lim zw’ sin{|a#+—]=0. 
«2-0 x 


This follows by using the fact that | sin(-)| is < 1, which shows that 


1 
x sin (« + | | < |z/°, 
a 


to which we can apply Proposition [7.4.2] 


O< 


7.5 Limits of composite functions 
Suppose f and g are functions. The composite f og is specified by 
(f og)(x) = (g(x), 


and its domain is the set of all x for which this exists. 
For example, 
re vV1l—-2? 


is the composite of the function 2 +> 1 — 2? (for all xz € R) and the function 
ut> /u (for u > 0); its domain is [—1, 1]. 
Turning to limits, it seems clear that 
if g(x) > q, as x > p, and f(v) > L,asvu—>q, 
we should be able to conclude that 


f(g(x)) > Las xz p. 


This is an extremely useful method and mostly we use it without even notic- 
ing; for example, using the simple result 


lim ——— = lim + lim (w* + w+ 1) =3, 
wl 
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we obtain a limit that is at first less obvious: 


by using the ‘substitution’ « = w’. 
This type of reasoning can encounter a rare breakdown. As an extreme 


example, consider the functions F' and G given by 
aie, 
and G(x) = 0 for all x. Then 
G(x) > 0as x —-0, and F(v) > Lasv— 0, 
but F'(G(x)) is stuck at the value 2 and so 
F(G(z)) Alasxz 0. 


What has gone wrong is that we have rigged the inner function G to keep 
hitting (in fact it is stuck at) the forbidden’ point v = 0 which is excluded 
from consideration when defining lim, F'(v). 

Treading around this obstacle we can formulate the composite limit very 
delicately in the following result. Recall that a point p is exterior to a set B 
if p has a neighborhood leying entirely outside B. 


Proposition 7.5.1 Let f and g be functions, defined on subsets of R, and 
suppose 


lim, 9(z) = q and lim,_,, f(v) = L. 
Let S be the domain of the composite fog. Assume that: 


(i) p is a limit point of S; 


(ii) p ts exterior to the set {x : x #p, g(x) = gq} AS. (In other words, p 
has a neighborhood Up with the property that there is no point in Uo, 
other than p itself, that is both in the domain of f and were g takes the 
value q.) 


Then lim: 4 f (g(2)) exists and is equal to L: 


lim(f 0 g)(x) = L. 


wp 
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Condition (ii) ensures that g(x) avoids the ‘forbidden’ value q for all x that 
are used in determining lim,-,, f(g(xz)). Notice that (ii) is automatically 
satisfied in the case q = ov, for g, being a real valued function, never takes 
the value oo. Another very convenient special case is when g(x) simply does 
not take the value q, except possibly when x = p. 

Proof. Let us begin by recalling what it means for f(v) > LZ as v > q. Let 
W be any neighborhood of L. Then there is a neighborhood V of ¢ such that 


f(v) € W for all v € V, with v 4 q, that are in the domain of f. (7.15) 
Next, since g(x) + q as x — p, there is a neighborhood U of p such that 
g(x) € V for all x € U, with x # p, that are in the domain of g. (7.16) 


We will focus on the neighborhood U of p shrunk down by intersecting 
with Uo: 
U; = U al Uo, 


which is still a neighborhood of p, of course. 
Now consider any point x in U;, with x # p, for which f(g(x)) is defined 
and g(x) 4 q; we are given that such an z exists. By we have g(x) € V, 
and then by (7.15), we have f(g(2)) EW. 
Thus, starting with any neigborhood W of L we have produced a neigh- 
borhood U, of p such that (f o g)(x) € W for all x € U, with x 4 p. |QED 


Chapter 8 


Trigonometric Functions 


We consider the trigonometric functions sin and cos. Though we don’t really 
discuss completely precise mathematical definitions for these functions, we 
extract enough information about them from trigonometry to be able to do 
calculus with these functions. Eventually one can use the results of calculus 
to construct definitions for sin and cos that don’t use geometry. 


8.1 Measuring angles 


What exactly is an angle? The most basic idea of an angle is that it is 
specified by two rays going out of a given vertex point. 


a, 


Figure 8.1: Angle as a pair of rays 


C 


This leaves open a small bit of ambiguity, as to whether we are thinking 
of the ‘smaller’ angle or the remaining ‘larger’ angle. 

One way to be more specific is to draw a circle, with center C’ at the 
vertex, and think of the angle as an arc of the circle marked off by the two 
rays. To be more precise we could just think of a circle of radius 1, with 
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he area of the shaded sector gives a 
measure of the angle 7PCQ (taking the 
radius CP to be the unit of length). 


Figure 8.2: Measuring angle using sectorial areas 


center C' at the vertex of the two rays, and then think of the angle as a 
sector marked off in the circle by the two rays. 

Then the radian measure of the angle ZPCQ is taken to be twice the area 
of the sector PC'Q. 

Why twice? This is just to be consistent with historical practice and 
convention. Take, as an extreme example, the full angle, so that the sector 
PC@Q is, in fact, the entire circular disk. The area of this disk is what is 
denoted 

1 


and so the full circular angle has radian measure 27. 
From this we can quickly see that 90°, which specifies a quarter circle, 


has radian measure 
1 1 


(27) =e 


This discussion has one element of haziness: what do we mean by the 
area of a curved region? For this please turn back to the Introduction. 
8.2 Geometric specification of sin, cos and tan 


We will describe the geometric meaning of the measure of an angle and also 
that of sin@ and cos@. 
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Regardless of how an angle might be measured, the geometric meanings 
of sin, cos and tan of an acute angle are illustrated in the classical diagram 
shown in Figure If the angle is specified by a pair of rays R, and 
Ro, initiating from a vertex C, we draw a circle, with center C’, and take 
the radius to be the unit of length. The ‘semichord’ from Ry», to R, is the 
segment, perpendicular to R,, that runs from the point Q where R» cuts the 
circle to a point on R,. The length of the ‘semi-chord’ is the sin of the angle. 
The cos of the angle is the distance from the vertex C' to the semi-chord. 
Then tan of the angle is the length of the segment tangent to the circle at Q 
to a point on Rj. 


Figure 8.3: Classical definitions of sin, cos, and tan 


The more cluttered Figure [8.4] provides more concrete formulas and also 
relates visually to the measurement of the angle @ in terms of the area of the 
sectorial region it cuts out of the circle. 

The line through P perpendicular to C’Q intersects the line C'Q at a point 
B. Let 


x=CB 


a (8.1) 


Here we take x to be negative if B is on the opposite side of C from P. We 
take y to be negative if 0 > 7. 
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>< 


- 9_BP 

sin 0=G5 

_ cB 

cos 0= 65 

_ BP 

yz tand=ae 
area sector CPQ = $(CP)6 


Figure 8.4: Measuring @ and sin 6, cos@, and tan 6 


The trigonometric functions are specified by: 


np 28. 
sin = CO’ 
B 

cos = (8.2) 
_@B 
tan = Gp 


where we leave tan@ undefined if 6 = 7/2 because in this case the denomi- 
nator C'B becomes 0. 


From (8.2) it is also clear that 


8.3 
cos 6 ee) 
as long as 0 £ 17/2. 


Geometrically, to be consistent with the preceding discussion, it is sensible 
to define 


sin 0 = 0; 


cos 0 = 1; (8.4) 
tan 0 = 0. 
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Observe also that 


sina /2 = 1; 
/ (8.5) 
cos7/2 = 0, 
and 
sina = 0; 
cos7 = —1; (8.6) 
tana = 0. 


If the angle @ is increased to 0+ 27 then geometrically the point Q remains 
where it is. Thus we can define the trigonometric functions for values outside 
[(0, 27) by requiring that 


sin(a + 27) = sina 


cos(a + 27) = cosa; (8.7) 


tan(a + 27) = tana, 


again as long as tan(a + 27), and hence tana, is defined. 

The above property of the trigonometric functions are summarized in 
words by saying that these functions are periodic and each has period 27 
(the values repeat every time a is changed to a + 27, and 27 is the least 
positive value with this property). 

When a € (0,7), sina is postive, and when a € (7, 277) the value sina is 
negative: 


0 if 0,7); 
sina.” ; eer): (8.8) 
<0 ifae (a,2z). 


For cos it is: 


Rae >0 ifae€ (—7/2,7/2); (3.9) 
<0 ifaé (r/2,37/2). , 
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8.3 Reciprocals of sin, cos, and tan 


The reciprocals of sin, cos and tan also have names: 


1 
j= 
ai sin 0 
1 
sec) = (8.10) 
cos 0 
1 
tO = 
re tan 6 


whenever these reciprocals are meaningful (for instance, cscO and sec(7/2) 
undefined). 


8.4 Identities 


If one angle of a right-angled triangle is @ then the other is 7/2 — 0. This 
leads to the following identities: 


— 6) = cos 


J sgt 
sin(5 
T ; 

cos(5 — 6) =sin (8.11) 
tan(5 = 0) = Cob: 
When an angle is replaced by its negative, it changes the sign of sin and 
tan but not of cos: 


sin(—a) = — sina; 
cos(—a) = cos a; (8.12) 
tan(—a) = — tana, 


with the last holding if the tan values exist. 
Pythagoras’ theorem implies the enormously useful identity 


sin? a + cos*a = 1, (8.13) 


for alla € R. Using this we can work out the value of sin, at least up to sign, 


from the value of cos: 
sina = +v 1 — cos*a. (8.14) 
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The only way to decide whether it is + or whether it is — is to consider the 
value of a: if a € [0,7], or differs from such a value by an integer multiple of 
27, then sina is > 0. 

Similar considerations hold for 


cosa = +V/1 —sin’a. (8.15) 


There are several relations among these reciprocal trigonometric functions 
that can be deduced from relations between sin, cos, and tan. For instance, 
dividing 

sin? 6 + cos?@ = 1 


by cos? @ produces: 


sin? 6 _ 1 
cos2 6 cos? 6’ 
which can be rewritten as 
1+ tan? 6 = sec? (8.16) 


for all 6 for which tan @ is defined. (For 6 = +7/2 one could define tan? 6 as 
well as sec? Oboth to be oo, and similarly for all the other trouble spots +7/2 
plus integer multiples of 27, and this would make valid for all 8 € R.) 

Very clever geometric arguments can be used to prove the trigonometric 
identities: 


sin(a + 6) = sinacosb + sin bcos a; 
cos(a + b) = cosacosb — sinasin}; (8.17) 
tana + tanb 


t b) = 
antes?) 1 —tanatanb’ 


where for the last identity we require, of course, that the tan values are 
actually defined. 
There are some consequences of these addition formulas that are also 
useful. The simplest are obtained by taking b = a in the preceding formulas. 
Special cases of these are also very useful: 


sin(2a) = 2sinacosb; 
= De. Ste = 2, _ —_ $5 
cos(2a) = cos“ a — sin“ a = 2cos*a — 1 =1 — 2sin“a; (8.18) 
2tana 


tan(2a) = Totanta’ 
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as long as the tan values are defined. 
Now consider 


sinx — siny. 
Suppose we choose a and b such that 
zt£=a+b 
{=o 0 
Then 


sin x — siny = sin(a + b) — sin(a — b) 


= (sinacos b + sinbcos a) — (sina cos b — sin bcos a) 


= 2sinacos b. 


(8.19) 


(8.20) 


Now we need to substitute in the values of a and 6 in terms of x and y by 


solving (8.19). Adding the equations gives 
x+y = 2a, 
and subtracting gives 
x —y = (a+b) —(a—5d) = 2b. 
Thus 


i) 
a= 5(x+y) 
b= Sr +y). 


Putting these into (8.20) produces 
_£-Y x+y 
sin x — siny = 2sin 5 cos 5 


Following the same line of reasoning for cos instead of sin gives us 
. &€-Yy, £t+y 
cos % — cosy = —2sin 5 sin 5 


Before moving on, let us note that taking —b in place of b in 


sin(a — b) = sinacosb — sin bcosa; 
cos(a — b) = cosacosb + sinasin b; 
tana — tanb 


tan(a —b) = ean? 
a 1+tanatanb’ 


(8.21) 


(8.22) 


(8.23) 


(8.24) 
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8.5 Inequalities 
The identity 
sin? a + cos?a = 1 
implies that neither sina nor cosa can be bigger than 1 in magnitude: 
sina| <1; 
| is (8.25) 
|cosa| < 1. 
For example, 
[S| = 5} |0| =0; and|—4| =4. 


Note, however, that both sina and cosa do reach the values 1 and —1 
repeatedly no matter how far away from 0 the value of a is: 


sin (5 + 2m) =1,; 
: (8.26) 
sin (-+ + 2nn) =-1, 
2 
for all integers n € Z. The same holds for cos: 
cos(27n) = 1; 
om) (8.27) 
cos(a + 2an) = —1, 
for all integers n € Z. 
Some geometric arguments with areas shows that 
cosa < 822 <1 for all x € (0, 7/2]. (8.28) 


Since sin(—x) is — sinx, we have 


sin(—x) sina 


2x x 


Moreover, we also know that 
cos(—z) = cos x. 
Thus we can go over to the negative side as well: 


cosa < 822 < 1 for all x € (—1/2,7/2) with « £0. (8.29) 
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8.6 Limits for sin and cos 


The functions sin and cos are continuous functions in the sense that their 
limits coincide with their values: 


lim sin = sin p 
2 

oe (8.30) 
lim cos 7 = cos p, 

wp 


for all p € R. (We will return to the notion of continuous functions later.) 
In particular, 


lim sin = sin0 = 0 
x0 


and 
lim cos x = cos0 = 1. 
x—0 


We can explore the behavior of sinx for x near 0 more carefully. Recall 
the bounds for (sin z)/a near 0: 


cosa < 822 <1 for all x € (—1/2, 7/2) with « £0. (8.31) 
We know that as x — 0 we have cosx — 1. So (sinx)/x being between 1 
and cos x also goes to the limit 1: 


lim" = 1. (8.32) 
z-0 
There are many useful and not-so-useful consequences of this limit. For 


instance, 


_ l-cosx ,. (1—cosx)(1+cos2) 
lim ——.— = lim 
+0 | a) x?(1 + cosz) 
1 — cos* x 


~ 2-30 x?(1+ cos z) (8.33) 
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In summary 
ee (8.34) 


z—0 xv? 2, 
Another very simply consequence of (8.32) is 


sin Kx 


lim =K, forall K ER. (8.35) 
«z—0 x 


This can be seen by observing that we can write 
snka  _,sinkax sin y 


xL Ka y 


where y = Kx. Clearly, y approaches 0 when x — 0, and so we have 


. sinka : sin y 
lim =limk 
xz—0 x yO Yy 


a ea ea a 


(To be honest, some reasoning is needed here to explain why one can pass 
from z + 0 to y > 0.) 


8.7 Limits with sin(1/z) 


The functions sin(1/x) and cos(1/z) are badly behaved functions near the 
value x = 0. 
When x = 1/(7/2+ 277), for any integer n, we have sin(1/x) = 1: 


. 1 
ore (; (x/2 + 2nn) 


On the other hand if ¢ = 37/2 + 27n then the value of sin is —1: 


) =sin(7/2+27n)=1 foralln eZ. 


. 1 
= (sas + Inn) 


Now any neighborhood of 0 contains the values 1/(7/2+27n) and 1/(37/2+ 
27n) for large enough integers n. Thus, in every neighborhood of 0 there are 
values of x for which sin x is 1 and there are values of x for which the value 
of sinx is —1. 

Thus 


) = sin(37/2+2an)=—-1 forallne Z. 


ae sin(1/r) = —1, and sup sin(1/z) =1 (8.36) 


«2eU,x40 xeU,xz£0 
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for every neighborhood U of 0. Thus there cannot be a unique value lying 
between the sups and infs. Hence: 


lim, 9 Sin + does not exist. (8.37) 


The problem here is that sin 1/z fluctuates too much near x = 0. These 
fluctuations can be dampened out by multiplying by x; consider 


f(x) = xsin(1/z) 1Or a SU 
Since sina is at most 1 in magnitude we have 
—|xr| < f(x) < |x| for alla 40. 


If we let x — 0 then clearly |”| — 0, and so the squeeze theorem implies that 
lim, +0 f(«) exists and is 0: 


. ie 
lim x sin — = 0. (8.38) 
x0 a 
Notice that the ‘product rule’ does not work here: 
x0 x x0 x0 


lim x sin eZ — (lim w) (tim sin =) FAILS 
x 


because on the right the limit lim,_.9 sin + does not exist. 


8.8 Graphs of trigonometric functions 


The graphs of sin and cos are waves, with sin passing through (0,0) and cos 
through (1,0). 
The graph for sin is 


Figure 8.5: Graph of sin 


The graph for tan blows up at +7/2, because 


lim tanz = —oo, and lim tanz = oo, 
wm /2+ ar /2— 


and similarly at —7/2. 
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y Yy = cosx 
—T T 
x 
| 1 / 2 
Figure 8.6: Graph of cos 
y 
! ! y = tan x 
-—f | i oO Qn 
: | — 


Figure 8.7: Graph of tan 


8.9 Postcript on trigonometric functions 


Once one has built up the full apparatus of calculus, with both derivatives 
and integrals, it is possible to reconstruct the functions sin, cos and tan 
directly in terms of calculus, without reference to any diagrams or traditional 
trigonometry. For example, here are the formulas for sin and cos that can be 
used to define them without using pictures: 


: 1 3 5 
Mh Se ee eae 


| | 
: 2 (8.39) 
cosa = 1— a2 a Ae Sees 


What the ‘infinite sums’ on the right mean exactly will be clear only after 
we have studied limits and sequences. It is, at this stage, impossible to see 
where the identities/definitions (8.39) come from. 
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Exercises on Limits 


Write out the limits or explain as needed: 
1. 


2: 


. lim, 
 litig-ses 
Siis 2.8 
iitieeee 


sd Wie Seat 


. img 500 


«mig 334 


: limg_5o 


. limg_,x/6 


lim,515 


lim, ,1(2? + 4¢ — 2) 


lima—+1 =a 


lim,-+ 


1 g2—1 


el a 


7x> +a-+cos(x3) 
2x>—5a2+1 


» AE 45 [Vx +1— Va] 
» limg soo [V3a? + 1 - V2? +1 
| 


. limgso0 | W404 +2 -— Vart+1 


Vaet+l 
Wr 


Si oo ez 4+2—Vax2 + 1] 


. limg soo Vt $2 [Ve +1 - Vo] 


sin(6?) 
62 


sin? (0) 
62 


sin(@—7/6) 
6-7/6 
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20. 
ZL 
22; 
23. 
24. 
25. 
26. 


27, 


lim, +0 719(z) 

lim, 9 £(1 — x)1Q(x) 

lim, 1 £(1 — x)1gQ(z) 

Explain why lim,_,3 (a — 1)lg(a) does not exist. 
Explain why lim, _,,. cos x does not exist. 
Explain why lim,_,,. x sin x does not exist. 


Explain why lim,_,.. att =.(). 


Explain why lim,-.. al = 0. 
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Chapter 9 


Continuity 


Continuous functions are functions that respect topological structure. They 
are also the easiest to work with in and therefore most suitable in applica- 
tions. 


9.1 Continuity at a point 


A function f on a set S' C R is said to be continuous at a point p € S if 
f(x) approaches its actual value f(p) when x approaches p: 


if lim,_,, f(z) = f(p) we say f is continuous at p. 


In case p is an isolated point of S we cannot work with lim,_,, f(x), but 
surely there is no reason to view f as being not continuous at such a point. 
So we also say that f is continuous at p if p is an isolated point of S. 

Here is a cleaner definition of continuity at p: 


Definition 9.1.1 A function f defined on a set S C R ts said to be con- 
tinuous at a point p € S if for every neighborhood W of f(p) there is a 
neighborhood U of p such that 


f(x) € W for allx € U. 


9.2 Discontinuities 


Sometimes a function is discontinuous (that is, not continuous) at a point p 
because the value f(p) is, for whatever reason, not equal to lim,-,, f(x) even 
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though this limit exists. For example, for the function g given by 


g(@) = a ae (9.1) 


4 if2=—3. 


we have the limit 
lim g(x) = lim (x +3) =6, 
2 


x3 
which is not equal to the value g(3). This type of discontiuity is removable, 
simple by changing the value of g at 3. 
On the other hand there are more serious discontinuties. For example, 
IZ x 


lim — = lim—-=1, 
z>0+ & 2r0 X 


whereas, approaching 0 from the left, 


Freee ae 
z-0- £ z-0 2 
There is a jump from left to right, and there is no way to remove this dis- 
continuity. 
The function 


in for all ; 
io (ane ei 


has a more severe discontinuity at 0 because sin(1/x) doesn’t even have a 
limit x — 0. 


9.3. Continuous functions 


A function f is said to be continuous if f is continuous at every point where 
it is defined. 
All polynomial functions, such as 


Bae? ss Bape a a Bae A 


are continuous. 
Here is a simple observation that is used often without mention: 
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Proposition 9.3.1 If f is continuous at every point of a set S and if T is 
a nonempty subset of S then f is also continuous at every point of T. 


You can check this easily by consulting the definition of what it means 
to be continuous at a point. 

If f is a function and T is a set contained inside the domain of definition 
of f then f|T7, called the restriction of f to T, denotes the function whose 
domain is T and whose value at any x € T is f(z). 

For example, consider the function 1g on R whose value is 1 on rationals 
and 0 on irrationals. The restriction 


1e/Q 


is the function defined on Q whose value at every point in Q is 1: in other 
words 1g|Q is just the constant function 1 on the set Q. 

The statement ‘f is continuous on a set 7” can have two different mean- 
ings: 


(i) f is continuous at every point of 7; 
(ii) the restriction f|T is continuous. 


For example 1g|Q is certainly continuous but lg is not continuous at any 
point of Q (or at any point at all). 


9.4 Two examples using Q 


The function 
Ig 
has the property that lim,-,,1g() does not exist for any p. Hence this 
function is discontinuous everywhere on R. 
We can damp out the discontiuity at 0 as follows: 


f(z) =a2l1lo(x) forallz eR. 


has the property that 
lim f(x) =0= f(0), 
I> 


but lim,_,, f(x) does not exist for any p 4 0. Hence f is continuous at exactly 
one point, that being 0. 
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To produce a function continuous at only the points 1 and 5 we take 1g(x) 
and multiply it with a function that is 0 at exactly 1 and 5; for example, 
xe (x — 1)(x — 5)1 Q(z) 


is continuous at 1 and at 5 but nowhere else. 

Can you manufacture a function that is continuous at exactly a given 
set of points and nowhere else? Is there a function that continuous at every 
point of (0,1) but at no other point? 


9.5 Composites of continuous functions 


Proposition 9.5.1 Suppose f and g are functions defined on subsets of R, 
and suppose f og 1s defined on a neighborhood of some p € R. If g is 
continuous at p and f is continuous at g(p) then f og is continuous at p. 


Proof. Let W be a neighborhood of L = f(q), where q = g(p). Then, by 
continuity of f at q, there is a neighborhood V of q such that f(v) © W 
for all v € V in the domain of f. Next, by continuity of g at p, there is a 
neighborhood U of p such that g(a) € V for all « € U in the domain of g. 
Hence if x € U is in the domain of f og then f(g(z)) EW. This proves that 
f og is continuous at p. | QED 


9.6 Continuity on R* 


In calculus we work with functions defined on subsets of R and having values 
in R. However, occasionally, it is useful to allow infinite values as well. No 
great additional work is needed for this; the definition remains exactly as 
before: 

a function F': S — R* is continuous at p € S' if either p is an isolated 
point of S or if lim,, F(a) = F(p). 

An equivalent alternative form is again just as before: F is continuous 
at p if for any neighborhood W of F(p) there is a neighborhood U of p such 
that F(x) € W for allx € U. 

If f : (a,b) > R is continuous and lim,_,, f(a) exists then we can extend 
f to a continuous function F’: [a,b) + R* by setting 

F(a) = lim f(z). 


wa 
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Of course, the same applies for the other endpoint b. 
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Chapter 10 


The Intermediate Value 
Theorem 


Consider the function f given by 


f(a) =e? —a? —224+1 for all xz ER. 
Figure is a sketch of its graph for x € [—1.5,2]. We can check easily 


+¥ 


y=a?—27*-2Qr+1 


> 


x 


Figure 10.1: Graph of x? — x? —2x2+1 


that 
f(i-)=1 and fO==1L 


Intuitively it is clear, from the continuous nature of the graph of y = f(z), 
that there must be a point p € [—1,1] where f(p) is 0. The intermediate 
value theorem, which we study in this chapter, guarantees the existence of 
such a point; thus, from this theorem it follows that there is a solution of the 
equation 

eg? = Oe] =) 
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on the interval —1, 1]. 
Its essence remains valid in settings far beyond the real line, but even this 
first glimpse of the idea, on R, is of great use. 


10.1 Inequalities from limits 


Suppose we know that a function f has the limit 


lim f(x) = 9. 


£5 


Then f(z) is close to 9, say within a distance of 1, if x is close enough to 5 
(but not 5); thus there must be a neighborhood U of 5 for which 


8 < f(x) and f(x) < 10, 


for all x € U with x 4 5. We summarize this idea in: 


Proposition 10.1.1 Let f be a function on a set S, and p € R* a point 
for which lim, f(x) exists. If K is less than this limit then there is a 
neighborhood U on whichk < f(x) for alla EU, x # p; thus, 


if K < lim,.,, f(x) then K < f(x) for alla eU,x #p (10.1) 
for some neighborhood U of p. Similarly, 

if M > lim, 4, f(x) then M > f(z) for alla cU,x Ap (10.2) 
for some neighborhood U of p. 


The proof proceeds a little bit differently from the intuition if we use our 
sup-inf definition of limit: 
Proof. Let 
b= li fe). 


«rw-p 


This means that L is the unique value satisfying 


inf f(z)<2< sup f(z) 
xeU,xz4p xeU,xeép 


for all neighborhoods U of p. So if kK < L then K does not lie between all 
such infs and sups; thus, there is some neighborhood U of p such that K 
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does not lie between infreu,2¢p f(x) and supzcu ney f(x). Since K < L, the 
only possibility left is 


K< inf 
<a Se) 


This proves 


(10.1). 
The result (10.2) for M > lim,-,, f(x) follows by a similar argument. 
QED 


10.2. Intermediate Value Theorem 


The completeness property of the real line has one big consequence for con- 
tinuous functions: if f is continuous on the interval [a,b] then f(x) runs 
through all the values between f(a) and f(b) as x runs over [a, }]: 


Theorem 10.2.1 Let f be a continuous function on |a,b], where a,b € R 
with a < b. Let t be any real number between f(a) and f(b): 


fla) <t< f(b) or f(b) <t< f(a). 
Then there is a point s € [a,b] for which f(s) =t. 


Proof. If t happens to be equal to f(a) then we are done; just take s = a. 
Similarly if ¢ = f(b). 

Suppose then that t is neither f(a) nor f(b), and so lies strictly between 
them. If f(a) < f(b) this means that f(a) < t < f(b), whereas if f(a) > f(b) 
then f(b) <t< f(a). 

Suppose 

fla) <t< f(0). 


Let S be the set of all x € [a,b] for which f(x) <t: 
S=1{Pe|0,6)* fie) <a} 


For instance, a € S. Moreover, b is an upper bound for S, because S is inside 
[a,b]. In fact a < s < b, because of Proposition [10.1.1] 

Then by the completeness property for R there is a least upper bound 
s = sup S, and this, of course, also lies in [a,b]. We claim that f(s) equals ¢. 
Consider any neighborhood U of s of the form 


U =(s—6,s+0), 
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where 6 is any positive real number. Since s is an upper bound of S, any 
point p of S strictly to the right of s (that is, p > s) is not in S, and so 


f(p) >t, 


for such p € [a,b]. Then 
sup f(x) >t 
«eU,a#s 
Since s is the least upper bound of S, any point p € U for which p < s is 
not an upper bound of S and so there is some g € S' with q > p. Of course 
q <8, since s is an upper bound of S. Hence q, lying between p and s, is in 
the neighborhood U. Since gq € S we have 


f(q) <t. 


This shows that the inf of f over U, even excluding the point s, is < ¢: 


oes ee 
Thus t satisfies: 
Ose ou, f(z) 
for every neighborhood U of p. Since f is given to be continuous at s we 
know that 


f(s) = lim f(x). 


Ls 


Hence t must be f(s). | QED 


10.3. Intermediate Value Theorem: a second 
formulation 


Here is another formulation of the intermediate value theorem: 
Theorem 10.3.1 If f is continuous on an interval J then the image 
def 
FJ) = {fle) : ee J} 


is also an interval. 
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Proof. To prove that f(J) is an interval we need only to show that all the 
numbers between any two distinct values y, y2 € f(J) also lie in f(J). Thus 
consider a point ¢ satisfying 


Yi <t< yo. 
Since y; € f(J) we have 
Y= f(a), 
for some a € J, and since y2 € J then 
yo = f (0) 


for some b € J. Thus, f is continuous on [a,b] and t lies between f(a) and 
f(b). Then by Theorem |10.2.1} there is a point s € [a,b] for which 


f(s) =t. 


Since s € [a,b] and a and b are points of the interval J it follows that s also 
lies in J. Thus any point t between y; and y2 is of the form t = f(s), with 
s € J, which just means that t € J. Thus, J is indeed an interval. | QED 


10.4 Intermediate Value Theorem: an appli- 
cation 


The number 
73/4 


is the positive real number whose 4-th power is (10)*: 
(Gee 


But how do we know that such a real number exists? We can obtain existence 
by using the intermediate value theorem. 
Consider the function 


4 


q(jz) =a" forallz ER. 


This is clearly continuous, and from 


q(0)=0 and q(7)=7' 
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we see that the number 7° lies between these extremes: 
q(0) < 7° < q(7). 


Hence, by the intermediate value theorem, there is a real number s € (0,7) 
for which 

eS: 
Could there be another positive real number s, whose 4-th power is also 7°? 
The answer is no, because if s > s, > 0 then s* is < s+ = 73, whereas if 
s <s, then st > s4 = 7%. Thus there is a unique positive real number whose 
4-th power is 73. This number is denoted 


Co 


In this was one can see that 
oy 
exists for all positive real x and all rational y. 
Returning to 7/4 we can extract some more information: we saw that s 


actually lies between 0 and 7. But we can sharpen this much further. Since 
7° = 343 we have 


q(4) = 4* = 256 < 7? < q(5) = 5* = 625. 


Hence 7°/* actually lies between 4 and 5. With more work we can narrow 
down the location of 73/4 systematically. 

It is clear that not much is special about the numbers 7 and 3/4 in this 
discussion. The intermediate value theorem (and, more fundamentally, the 
completeness of IR) shows that for any real number x > 0 and any rational 
number r = p/q, with p,q € Z and q # 0, there is a unique non-negative real 
number x” which satisfies 

(a?/2)9 igh 


10.5 Locating roots 


Consider the equation 
a’ —32+1=0. 


There is no systematic way to work out exact solutions of equations such as 
this. However, there are many ways of determining information about the 
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solutions as well as finding very good approximations to them. Here let us 
see how the intermediate value theorem shows that there are solutions of the 
equation and helps localize them somewhat. 

Consider the function 


fia = 301 for allx ER. 
This is clearly continuous. Let us check a few values of f: 
f(-2)=-123, f(-1)=1,  f@)=-1, F=-3,  fQ)=121. 
Since f is continuous and 0 lies between f(—2) and f(—1): 
(221826 2i1S 71: 
it follows by the intermediate value theorem that 
there is a point p € (—2,—1) where f(p) =0. 
This means that the equation 
a’ —32+1=0 (10.3) 


has a solution on the interval (—2,—1). 

By the same reasoning we see that the equation (10.3) also has a solution 
in (—1,0) and and a solution lying in (1, 2). 

One way of pinning down the location of the solutions of would 
be to divide, say the interval (1,2) into ten pieces, each of width .1, and 
checking the values of f at the points 


FQ), FU.1), £12), ..., F(1.9), F(2), 
to see where f changes from negative to positive. We can calculate 
f(1.1) x —0.35 f (1.2) ~ 0.98 


and this tells us there is a root (which means the same as ‘solution’) of the 
equation (10.3) in the interval (1.1, 1.2). Next, repeating the same strategy 
by dividing (1.1, 1.2) into ten pieces and calculating the values 


f(1.11), f(1.12), f(1.13), (1.14), f(1.15), ..., (1.19), f (1.2), 
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we observe that 
f(1.13) ~ 0.037, and —_—sf (1.14) © 0.082. 
Thus, there is a root in the interval 
(1.13, 1.14). 


This is a slow and inefficient process, but a first process nonetheless to sys- 
tematically pin down a root of an equation. Later, with the use of calculus, 
we can study much faster methods for locating roots. 


Chapter 11 


Inverse Functions 


In this chapter we show by using the intermediate value theorem that equa- 
tions of the form 
y = f(z) 
can be ‘solved’ for a good class of continuous functions f. The solution is 
then displayed as 
x= f'(y), 


and f~! is the called the inverse of the function f. 


11.1 Inverse trigonometric functions 


The graph of y = sin x oscillates between —1 and 1. 


Figure 11.1: Graph of sin. 


If we try to solve an equation such as 


sing =.5 
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there are infinitely many values for ¢: 


nT OW 5a T 5a T 57 
2 On, on, ae, tae, ... 
ea re Rg Ge eg a pe ae 


Each of these could be thought of as an ‘inverse sin’ for the value .5 in 
the sense that the sin of each of these is .5. However, to avoid ambiguity we 
can focus on just the value 7/6: what makes it unique is that it is the only 
value between —7/2 and 7/2 whose sin is .5. 


y =sinz for x € [—2/2,7/2] 


x 


Figure 11.2: Graph of sin over [—7/2, 7/2). 
We define arcsin(.5) to be 7/6: 
. def . _]y 
arcsin(.5) = sin-°(.5) = 7/6. 


More generally 


sin-'(w) is the unique value in [—7/2, 7/2] whose sin is w, (11.1) 
that is, 
sin(sin“'w) =w and sin7' w € [—1/2, 7/2]. (11.2) 
Thus, 
y =sin~' x means that y € [—7/2,7/2] and siny is x. (11.3) 


Since the values sin always lie between —1 and 1, there is no value whose 
sin is 2; thus 


sin! x is not defined, as a real number, if x is not in [—1, 1]. 


On the positive side, 
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Proposition 11.1.1 Jf A € [—1, 1] then there exists a unique B € |—1/2, 7/2] 
for which sin B = A. 


Proof. The function sin is continuous on |—7/2, 7/2] and the end point values 
are 
sin(—7/1)=-—1, and sin(a/2) =1. 
Therefore, by the intermediate value theorem, for any A € [—1, 1] there is a 
B € [|-1/2,7/2] for which 
A=sinB. 


To see that B is unique simply observe that the function y = sin x is strictly 
increasing on [—7/2,7/2| and two different values of x could not have the 


same value for y = sinx. | QED 
Thus, 


sin”! x is defined for all x € [—1,]1]. 


We can run through the same arguments, with minor changes, for the 
function cos. We have to be careful to observe that y = cos x is not strictly 
increasing on |—7/2,7/2]. For example both 7/3 and —7/3 have cos equal 
to .5: 


cos(—1/3) = cos(m/3) = .5. 


Figure 11.3: Graph of cos. 


One possibility is to work with the interval |—7, 0] on which cos is strictly 
increasing, but there is a bias against using negative values when positives 
would work. So, instead we use the interval 


[0, 7] 
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on which cos is strictly decreasing. 
Running through the argument we conclude that 


for every A € [—1,1] there is a unique B € [0,7] for which cos B = A. 
The unique value y in [0,7] for which cosy is x is denoted cos7! z: 
cos~'(x) is the unique value in [0,7] whose cos is 2, (11.4) 
or, equivalently, 


y =cos-'x means that y € [0,7] and cosy is z. (11.5) 


We can run the same reasoning for tan as well and see that 


for every A € R there is a unique B € [—1/2, 7/2] for which tan B = A. 


tan~'(a) is the unique value in [—7/2, 7/2] whose tan is 2, (11.6) 


or, equivalently, 


y =tan7'z means that y € [—7/2, 7/2] and tany is x. (11.7) 


11.2 Monotone functions: terminology 


We say that a function f is increasing if 


f(s) < f@) 


for all s,t in the domain of f for which s < t. We say that f is strictly 
increasing if 
f(s) < f@) 
for all s,¢ in the domain of f for which s < t. 
A function f is decreasing if 


f(s) 2 fF) 


for all s,t in the domain of f for which s < t. We say that f is strictly 
decreasing if 


f(s) > F(t) 
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for all s,¢t in the domain of f for which s < t. 

Clearly, a function f is (strictly) increasing if and only if —f is (strictly) 
decreasing. For this reason we will often state or prove results just for in- 
creasing functions, it being generally understood from context that the cor- 
responding result for decreasing functions also holds. 

A monotone function is a function that is increasing or that is decreasing. 
We say f is strictly monotone if f is stricly increasing or strictly decreasing. 

The function 

oi Saas eat (11.8) 


is an increasing function that is not strictly increasing. 


11.3 Inverse functions 


Here is a somewhat strange result, guaranteeing continuity of certain types 
of strictly monotone functions: 


Proposition 11.3.1 Jf g is a strictly monotone function defined on a set 
S CR such that the range g(S) is an interval then g is continuous. 


The ideas developed for arcsin and arccos can be summarized in a general 
way: 


Proposition 11.3.2 Jf f is a continuous strictly monotone function on an 
interval U C R then the range V of f is also an interval, and there is a 
unique function f—' defined on V such that 


eC hcot Ge: for alla € U. (11.9) 


This inverse function f~! is continuous. 
The inverse function also satisfies 


f(ftW))=y — forallyeV. (11.10) 
For example, the inverse of the function 
(0,00) 4 [0,00) : 2H 2? 


is the function 
[0, 00) + [0,00): AW VA. 
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Proof. We work with the case when f is strictly increasing; the case of strictly 
decreasing is settled in an exactly similar way (or by applying the result for 
strictly increasing functions to —f in place of f). 

To show that V is an interval we have to show that if c,d € V, with c < d, 
then every point between c and d is also in the range V of f. For c,d € V 
we have 


c=f(a) and d= f(b), 


for some a,b € U. Now consider a point gq strictly between c and d: 
clay <a. 


This means 
fla) <a < f(d). 
By the intermediate value theorem (keep in mind f is continuous there is a 
point p € (a,b) for which 
q= fp). 
This means qg € V, the range of V. Thus V is an interval. 

Now defined f~! on V as follows. If y € V then there is a point x € U 
with f(x) = y. There cannot be any other point in U whose image under f 
is also y, because f is strictly increasing (points strictly below/above x are 
mapped by f to points strictly below/above y). Set 


j(W=e tfe=%. 


This proves (11.9), by simply writing in the value f(x) in place of y in 


fy) = 2. 
Applying f to both sides of f~!(y) = x shows that 


fF) =f =4 
which proves (11.10). 


Continuity of f~! follows on applying Proposition [11.3.1] to the function 
g = f~', whose range is the interval U. |QED 

Suppose f is a strictly increasing function defined on a set S. Then f(S) 
can be thought of as S with the points p renamed as f(p), and the ordering 
of the points is preserved: 


f(s) < f(t) if and only if s<t. 
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Thus, f(S) contains a largest element if and only if S contains a largest 
element, and f(S) contains a smallest element if and only if S contains a 
smallest element. This gives us: 


Proposition 11.3.3 Jf f is a continuous, strictly increasing function, then 
for any interval J in the domain of f, the image f(J) is of the same type as 
J; specifically, 


(i) if J = [a,] then f(J) = (f(a), FO); 


(ii) if J = (a,b] then f(J) = (c,d], where c = inf f(J) and d=supf(J) = 
(0); 


(ii) af fash) then ft Wl) = |ed) ,where.c- = int (J) =f (a) and: a = 
sup f(J); 


(iv) if J = (a,b) then f(J) = (c,d), where c = inf f(J) and d= sup f(J). 
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Chapter 12 


Maxima and Minima 


A fundamental feature of continuous functions is that they attain maximum 
and minimum values on certain types of sets such as closed intervals [a, }}, 
for a,b € R with a < b. 


12.1 Maxima and Minima 


The completeness property of the real line has another big consequence for 
continuous functions: if f is continuous on the interval [a,b] then f(x) ac- 
tually attains a maximum value at some point and a minimum value on the 
interval |[a, b]. 


Theorem 12.1.1 Let f be a continuous function on |a,b], where a,b € R 
with a <b. Then there exist c,d € [a,b] such that 


flo) = ink, f(x) 
fd) = sup f(a). 


x€[a,b] 


(12.1) 


Before proceeding to logical reasoning here is our strategy for finding a 
point where f reaches the value 


M = sup f(z). 


x€[a,b] 


Let us follow a point t, starting at a and moving to the right towards b and 
keep track of the ‘running supremum’ 


5;(t)= sup f(e). 


x€[a,t] 
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If f(a) itself is already the maximum M then we are done; assuming then 
that f(a) < M, surely the ‘first exit time’ t when S;(t) escapes from below 
the value M is where f actually takes the value M/. Thus our guess for d is 


d, = sup Bu, (12.2) 
where By, is thet set of all t for which S(t) is below M: 
Bu = {t € [a, 8] : S(t) < M}. (12.3) 


(We are assuming the initial value f(a) isn’t already M.) It is useful to have 
in mind a graph of Sp: it increases (possibly remaining constant on stretches 
of values of t) and once it hits the value it stays there all the way to t = b. 

It is intuitively clear that for t to the left of d, the value S(t) is < M 
whereas for any ¢ to the right of d, the value of S(t) is M; this would imply 
that that the supremum of f on any neighborhood of d, is in fact M. Then 
from the definition of limit lim, .4, f(a) as the unique value between suprema 
and infima of f over neighborhoods of d, we would then have lim,_,a, f(x) = 
M,; continuity of f at d, would then imply f(d,) = M. 

Observe that if S;(a) < M then, since d, is an upper bound of By, it 
follows that d, > x. Thus, no point to the right of d, has Sr-value < M. 
Hence: 

Sy(x)=M for all x > dy. (12.4) 


Proof of Theorem {12.1.1} We show only the existence of a point d where f 
attains its maximum value. The argument for minimum is exactly similar 
(or we can use the trick of applying the maximum result to —f in place of f 
to find where f is minimum.) 

Let us go through the remaining argument slowly, breaking it up into 
pieces. 

If f(a) happens to be equal to M then, of course, we are done, on taking 
d to be a. Suppose from now on that f(a) <M. This implies, in particular, 
that the set By is not empty, containing at least the point a. 

Consider now any neighborhood U of d,. Choose any r € UN |a, b] with 
r > d,; what if d, = b? We will deal with that case later, assuming for 
now that d, < b. Then, as already noted before in (12.4), Sy(r) = M. 
Consequently, 


sip fie) =i. 


xE€UN|{a,b] 
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Hence / satisfies 


inf f(x)<M< sup f(z), 
x€UN|a,b] x€UN|a,b] 


wth the second < being actually an equality. This is true for any neighbor- 
hood U of d,. Therefore, by our definition of limit, 


lim f(z) = M. 


wd. 


But f is continuous at d,. Hence 


and we are done. 
Lastly suppose d, = b. Then taking any q € [a, b] withg < b, we know that 
q is not an upper bound of By, (for d, = 6 is the least upper bound of By,). So 
there is a p > q in |a,b] which is in By, and this means sup, <iq, f(x) < M. 
Therefore also 
sup f(a) <M. 
v€[a,q] 
But since supzejay) f(z) is M we must have sup,¢(q.j f() = M. Thus the 
supremum of f over every neighborhood of d, (which is b) is M. Then by the 
arhument used in the previous paragraph it follows again that f(d,) = M. 
The result for infze[a») f(v) is obtained similarly or just applying the 
result for sup to the function —f instead of f. | QED 
The preceding heavily used result works for functions defined on closed 
intervals [a,b], with a,b € R. But what of functions defined on other types 
of intervals? For example, for the function 


1 
— for x € (0, co) 
- 


it is clear that the function is trying to reach its supremum oo at the left 
endpoint 0 and its infimum 0 at the right endpoint oo. Figure shows 
the graph of the function given on (0,00) by 2? + 2 — 2. The function has 
sup equal to oo, which is the value it is trying to reach at both endpoints 0 
and oo of the interval (0, 00); the inf occurs at x = 1 and the corresponding 
minimum value is 1? + 2 —2=1. 
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Figure 12.1: Graph of x? + 2 — 2, for x > 0. 


Proposition 12.1.1 Suppose f is a continuous function on an interval U C 
R, with a,b € R* being the left and right endpoints, and suppose that both 
Lo = lim, 44 f(x) and L, = lim,_,, f(x) exist and are in R (finite). 

Then either f attains a maximum value in the interior of U or sup, cy f(x) 
is the larger of the endpoint limits La and Ly. 

Moreover, either f attains its minimum value at some point in the interior 
of U or infzeu f(x) is the least of the two endpoint limits Lg and Ly. 


We will not work through the proof but sketch the ideas. Observe that 
we can extend the function f to be defined at the endpoints a and 6 (if one 
or both of them is not already in U) but setting f(a) = LZ, and f(b) = Ly. 
Then f is defined on the interval [a,b] C R* (denoting the left endpoint of 
U by a), and f is allowed to take the values +00 at the endpoints a and b. 
The resut of the argument follows the proof of Theorem 


12.2. Maxima/minima with infinities 


The arguments used to prove existence of maxima and minima work without 
much change for functions defined on subsets of R* and with values in R*: 


Proposition 12.2.1 Jf F': [a,b] > R* is continuous function, where a,b € 
R* with a < b, then F attains a maximum value and a minimum value on 


DRAFT Calculus Notes 11/17/2011 111 


[a,b]. Thus, there exist points c,d € [a,b] such that 


Ee.= ce F(a) 
F(d) = sup F(z). a2) 


x€[a,b] 


12.3. Closed and bounded sets 


Consider a set kK C R that is closed and that lies inside an interval [a, }}, 
where a,b € Randa < b. Such a set is closed and bounded. 


Theorem 12.3.1 Jf f : K > R ts a continuous function on a nonempty 
closed and bounded set K then f attains a maximum value and a minimum 
value on this set kK. Thus, there exist points c,d € K such that 

f(c) = inf f(x) 


cek 


f(d) = sup f(z). 


cek 


(12.6) 
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Chapter 13 


Tangents, Slopes and 
Derivatives 


The geometric notion of tangent is most easily understood for circles. A line 
is tangent to a circle at a point if that is the only point where the line and 
the circle meet. Another definition uses more geometry: a line is tangent to 
the circle through the point P, with center C, is it is perpendicular to the 
radius C'P. 

Both of these ideas are illuminating and reflect our intuition of what a 
tangent line ought to be. However, neither notion works very well for other 
curves. For example, visually it is perfectly clear that the line y = 1 is 
tangent to the graph y = sin, yet it meets this graph at infinitely many 
points. On the other hand any ‘vertical’ line meets y = sing at just one 
point and yet such a line is surely not tangent to the graph. Since the graph 
y = sinz has no natural notion of ‘center’, it is also useless to try to define 
tangent as a line perpendicular to a ‘radius.’ 

A geometrically elegant formulation of the notion of tangent arises natu- 
rally for the case of ellipses. Think of a circle C’, and a tangent line I, at a 
point P, to the circle, drawn on a transparent sheet of paper. When a light 
is shown on the sheet from an angle, from a flashlight, the shadow C’ cast 
on a wall by the circle C is a stretched out version of the circle. This curve 
C’ is called an ellipse. The shadow I’ cast by the tangent line / is again a 
straightline and is surely the tangent line to the ellipse C’ at the point P’, 
which is the shadow of P. This notion goes back to the greek study of conic 
sections. 

Elegant though it is, even the method of the preceding paragraph fails 
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to provide a definition of tangent that works for more general curves. For a 
general curve C’ we need to view a tangent line at a point P as a limiting 
form of the PQ (where Q is a ‘nearby’ point on C’) as Q approaches P. This 
is formalized in the next section. 


Figure 13.1: Tangent line and secant segment. 


13.1 Secants and tangents 


Consider a function f defined on R and think of the graph of f: 
{(z,y) : y= f(x), x € R}. 
Consider now a point 
P= (La, Ya) 
on this graph. 

A secant line is a straight line through P and any other point Q on the 
graph. 

Now think of all secant lines PQ, where Q = (x,y) runs over points on 
the graph with x lying in some neighborhood U of x,. There are possibly 
many such lines, each with a different slope: (y — y,)/(x — xx). 

We shall say that a line l is tangent to the graph of f at the point P if 
it is the unique line that passes through P and has slope lying between the 
sups and infs of slopes of all ‘nearby’ secant lines: 

f(x) — f(a.) 


inf LOS) <slope(/) << sup — ———— (13:1) 
reU,cALs Lv — Lx reU cZ£xx LT — Xx 
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for all neighborhoods U of x,. If f is defined only on a subset S of R then 
we modify this definition by replacing x € U with x € UMS. We interpet an 
infinite value, co or —oo, of slope to mean that the tangent line is ‘vertical’, 
parallel to the y-axis. 

Another way to view the uniqueness of tangent line is to observe that this 
means that the slope of the tangent line is 

slope of tangent at point P = lim Fla) ~ fla.) (13:2) 
TL x L— Xx 

with the existence of this limit signifying the existence of a tangent line. The 
tangent to the graph y = f(x) at P(x,, f(x.)) ts the line through P with slope 
given by (73.2). 

The slope of the tangent line to y = f(x) at a point P is also called the 
slope of the curve y = f(z). 

For some functions there may be multiple lines / with slope satisfying the 
condition (13.1). For example, for the graph of 


y = |2| 
any line with slope € [—1, 1] satisfies the condition (13.1). Here is an illus- 


tration of a graph with a whole range of slopes satisfying the condition (13.1) 
at the point P: 


Occasionally we will consider such a ‘quasi-tangent’ line to a graph. 
Though this is not a standard notion, let us agree that by a quasi-tangent 
line at P(p, f(p)) to the graph y = f(a) for a function f we mean a line 
through P of slope satisfying the bounds 

f(w) — f(p) 


inf fw) ~ fe) <slopeofl< sup ————, (13.3) 
weUNS,wexp W— Dp weUNSjwe¢p Wp 
for every neighborhood U of p, where S is the domain of the function f. 
(Note that, as with tangent lines, this notion is meaningful only when p € S 
is not an isolated point of S.) 
Thus, y = f(x) has a tangent line at P if and only if it has a unique 
quasi-tangent line, and in this case the quasi-tangent line is the tangent line. 
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13.2 Derivative 


Consider a function f defined on a set S C R and let p be a point of S that 
is not an isolated point. The derivative of f at p is defined to be: 


gO 10. 


xL—>p xL—p 


(13.4) 


Thus the derivative f'(p) is the slope of the tangent to the graph y = f(x) at 
the point (p, f(p)). Of course, if the graph fails to have a tangent line then 
it fails to have a derivative. 

Let us look at some simple examples. First consider the constant function 
K whose value everywhere is 5: 


K(g):=5 for all x. 


Common sense tells us that the slope of this is 0. We can check this readily 
from the official definition 


Raye 
ip OD) =O: 


«wp x—p rp X—p «tp 


We can elevate this observaton slightly by observing that we don’t need kK 
be equal to 5 everywhere, but just on a neighborhood of p. 

If the function f is constant near p, then f(x) = f(p), for x in a neigh- 
borhood of p, and so the derivative f’(p) is 0. This just says that the graph 
is flat. Thus, 


If a function is constant on a neighborhood of a point p then the 
derivative of the function at p is 0. 


Next, consider the function 


Ko) =e for allz ER. 


Then for any real number p we have 


re 2 et 102) Some Po ea can 
rp L—p rp tL —p rp 


Hence the slope of 
yur 
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is 1: surely this is geometrically utterly obvious. 
We can check readily that for the function 7 +> Ma+C, where M and 
C are real numbers (constants) the derivative is MM everywhere: 


lim See) = ine = lim M = M. 
x—>p L—p «@—p L—p xp 
13.3. Notation 
The derivative if f at p is denoted 
f'(p). 


This is good for theoretical proofs and such but not very useful for practical 
algebraic calculations. 
If a formula is given for f(a) we denote the derivative of f at x by 


df (x) 
dx 


= f"(z). 


The beginner’s error in this notation is to put in a value for x in df(x)/dz: 
that is wrong usage of the notation: 


a") is wrong notation. 
Instead we should write 
df (a) _ df (x) 
jj, «at & = 3, or -F ee 


If we are writing y = f(x) then the derivative of f at x is 
dy 
dx 
This notation meshes well with the derivative being the limit of the ratio of 
the increase in y to the increase in 2: 


dy, Ay 
eager oo) 


where 
Ay=ygq-—yp, At=2g-2p, 
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with P being the point (x,y) = (a, f(x)) and Q(w, f(w)), and the limit 
Q — P is encoded in Ax — 0. 
For algebraic calculations the notation f’(p) is inconvenient. Instead we 
use the notation 
df (x) 


dx 
to denote the derivative of f at x. For example, for the function s given by 


s(x) = x” for alla eR 
we denote the derivative s’(x) by 


; dx? 


Note that we don’t mean that this is an actual ratio, nor do we mean that 
somehow the ‘denominator’ is d times x. The entire expression dx”/dx shoud 
be viewed (at least at this stage) as one object, the derivative. 


13.4. The derivative of x? 


Let us work out the derivative of the function given by f(x) = 2? for all 
x € Rat x =3. This is just the slope of the tangent line to the graph 


y= x? 
at the point (3, 3°): 
The slope of the secant line PQ is 


gs? 


slope PQ = 3 
Xx — 


To find the slope of the tangent we need to let Q approach P; this means 
x — 3, and we are looking then at the limit 


We can work this out easily. (Warning: avoid the 0/0 trap!) We factor x? —3? 
as a product 
= 3? = (¢=3)(¢ +3) 
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Figure 13.2: A secant segment to y = x? at P(3,37). 


and obtain 
2 92 _ 
lim = m (x — 3)(a@ + 3) 
273 TK 3 L243 4 (od 3 
ee (13.6) 
r>3 


Thus the slope of the tangent to y = x? at the point (3,3) is 6. 
If you trace through the calculations above for a general point (p, p”) on 
y = x” you see that the slope of the tangent at (p, p?) is 2p: 


2 aged _ 
we — pr _ ye — p(t +P) 


lim 
zp £—p «ap xL—p 
= lim(x + p) (13.7) 
xp 
= 2p. 


Thus the derivative of the function given by f(r) = x? at x = p is 2p. 
Using the notation df(x)/dzx this is displayed as 


2 
~ = Qn. (13.8) 
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Figure 13.3: Secant segment for y = x? at P(z, x). 


13.5 Derivative of x° 


Let us do the calculation of the derivative for the function f(r) = 2°. Fol- 


lowing the method used for y = x? we have first the picture 


We can see that 


uw — 2 


slope of PQ = : 
w—-2 


Letting Q + P makes the secant line PQ approach the tangent line at P in 


the limit. The slope of the tangent at P is then 


tee 


slope of tangent at P = lim : 
wor wWw—-2 


This just the derivative at x: 


(13.9) 
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13.6 Derivative of x” 


The procedure used for x? and x” works for x", where n is any positive 
integer. This leads to 


d n 
—— = ng"), (13.10) 


Thus, for example, the slope of the curve 


at the point (1, 1) is 
plese 


13.7 Derivative of x7! =1/z 


For 1/a we have the graph in Figure [13.4] 


P(2,1/zx) slope PQ = = 
Q(w, 1/w) 


Figure 13.4: Secant segment for y = 1/xz at P(x,1/z). 


The derivative at x is the slope of the tangent at P(x, 1/z): 


di 


a im, (slope of PQ). 
The slope of PQ is: 


slope of PQ = Gaal). 


Wz 
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Then we can calculate the derivative as follows: 


d(1/2) (1/w) — (1/2) 


eats Bias ag 
dx 5 wx 
in @ WL Cow) 
(using 4-5 = 258) 
L—w (13.11) 
= lim ————~ 
woe cw(w— x) 
. -l 
= lim — 
1 
=-—. 
Thus: 
d(l/x) 1 
dx — 


aa * 
=-|-.- g 2, 
dx 
even though n = —1 is not a positive integer. 


The negative sign on —1/z? indicates a downward sloping tangent. 


13.8 Derivative of x * =1/x* 


Let k be a positive integer and consider the function 
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We can calculate its derivative: 


a(x’) 5, A/w') = 1/2") 


dx wre w-2 
k oak kk 
je, = w)/(aw) 
wa w—-Z 
Sr me, Ay 2 ol 
(using 4 — 3 = 4p) 
ok — wk (13.12) 
— jj 
os x*wk(w — x) 
k k 
wy — 2 1 
= lim(-1 
ae w—x xkwk 
1 
k-1 
where in the last step we used the derivative of x*: 
k pk 
ia ka*-! 
wor Ww—- 
Thus é 
d(1/x*) 1 1 
dg — heer = he (13.13) 


Writing n for —k this reads 


hag 
n-1 
= Nx 5 


dz 


correct again, even though n is now a negative integer. 


13.9 Derivative of x!/? = x 


Consider the function 
s(x) = fa = 2? 
defined on all x > 0. Consider any p > 0. Then the derivative of this function 
at, p is the slope of the tangent at P(p, ,/p) to the graph y = \/z, and we 
know that 
slope of tangent at P = lim (slope of PQ). 
Q->P 


124 Ambar N. Sengupta 11/6/2011 


Figure 13.5: Secant segment for y = /x at P(p, \/p). 


= tog, VE VOW + yP) 
wp (w —p)(/w + /P) 
(using (A — B)(A + B) = A? — B?) 
a (13.14) 


when p > 0. If p = 0 we can just set p = 0 in the preceding calculations 
except for the very last line, being careful to note that the values w we work 
with are > 0 (because s(w) is not defined for w < 0; thus, 


(0) = Jim 
s'(0) = lm — WW = 
w—-0+ 2(./w + V0) 
Thus the derivative of \/z is 
dye _ aa (13.15) 
Ge. Qe 


when x > 0, and is co when x = 0. 
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Note that 1 
Jo=xr? and fi = V2, 


and so the right side in (13.15) is 


gg le to 
2 
Thus ifs 
d. | 
; = 5a, (13.16) 
x 
again agreeing with 
dx” n-1 
a 


now with n = 1/2. 


13.10 Derivatives of powers of x 


Let r = p/q be a rational number, where p and q are integers, and q # 0. 
Then for any x > 0 the power 2/4 is the non-negative real number whose 
g-th power is x?: 

(aP/2)4 — xP. 
The existence of such a real number x” follows from the intermediate value 
theorem, and ultimately is a consequence of the completeness of the real line. 

The following derivative formula holds: 
d is 
gg (pty) 

dx 
and can be proved by extension of the methods used before for negative 
powers of x and for «1/2, 


13.11 Derivatives with infinities 


It does not seem useful to bother defining the derivative F’(oo), for a function 
F, defined at oo. A natural extension of the geometric intuition of the 
derivative in terms of tangent lines leads to the definition 


F’(p) = lim al, 


«w—>p G 


if p € {—oc0, oo}. (13.18) 
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If F(x) approaches a finite limit F’(co), as x + oo, then F’(p) is 0, which 
conforms to intuition: the tangent line at 2 = oo is the ‘horizontal’ line 
y = F(oo). 


Chapter 14 


Derivatives of Trigonometric 
Functions 


In this chapter we work out the derivative of sin, cos, and tan, by using their 
algebraic properties and the fundamental limits 
sin 6 _ tand 


=1 and lim 
6-0 6-0 


Ls 


14.1 Derivative of sin is cos 


The derivative of the funtion sin at 2 € R is the slope of the graph of 


y=sing 
at the point P(z,sinz). Thus it is: 
sin’ x = lim (slope of PQ), 
WX 
where Q is the point (w, sin w). 
Now the slope of PQ is 
slope of PQ = a ey 
w-2 


We have then 


ot . sinw—sing 
sin’ x = lim ————__.. 
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slope PQ = Smwosing 


W-2L 


Figure 14.1: Secant segment for y = sinz at P(z,sinz). 


To work this out there are two possible routes. We follow one, using the 


relation (8.22), which implies: 


: : . W-2Zz wW+axr 
sinw — sing = 2sin COs (14.1) 
2 2 
Using this we have 
2 ain 2@— cog 2 
sin’ 2 = lim 2 2 
ee (14.2) 
_ 2sin ; wta« 
= lim cos 
wr W—2 2 


To make this look more like something involving sin@/@ (whose limit we 
understand), we write this as 


sad , 2S wtezx 
sin x = lim aoa COS 

wor 2 a 2 

= lim ae 2 cos ae (14.3) 
wor x 2 

Hea tb 
=e 
cos 5 
Thus we have found the derivative of sin: 
He 
ent = sin’ x = cosz. (14.4) 


dx 
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14.2 Derivative of cos is — sin 


The derivative of the funtion cos at x € R is 


; _  cosw—cosz 
cos x = lim ——__. 


Recall the relation (8.23): 


_ w 
cosw — cosx = —2sin sin 


Using this we have 


w-2f oy, wWter 


cos’ x = — lim e 


2sin 


2sin 


WX 


5 w+ 
COs 


= — lim 


wee we 


Thus we have found the derivative of sin: 


dcos x ; : 
= cos © = —Ssinz. 
dx 


14.3. Derivative of tan is sec? 


The derivative of the funtion tan at x € R is 


: . tanw —tanz 
tan’ « = lim : 


Recall the relation (8.24): 


fant b) tana — tanb 
an(a — 6) = ——————_-.. 
7 1+tanatanb 
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(14.5) 


(14.6) 


(14.7) 
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From this we have 


tana — tanb = (14+ tanatanb) tan(a — 6). 


Taking w for a and «x for b we have 


tan w — tang = (14+ tanwtan2z) tan(w — 2). 


We can proceed to the derivative now: 


) . tanw —tanz 
tan’ x = lim ——————— 


wor wx 
1+t t t = 
= jail + tan w tan x) tan(w — 2) 
wor wx 


(14.8) 
tan(w — x) 
= lim (1 + tan w tan x) ————— 


wn WX 
= (1+tanztanz)-1, 


because 


i tan@ 
6+ OQ 


hs 
Thus, 


tan’x =1+tan’2. 
Now recall that 1 + tan? x is sec? x. Hence 


dtan x ; , 
= tan xv = sec’ x. 
dx 


(14.9) 


Chapter 15 


Differentiability and Continuity 


15.1 Differentiability implies continuity 


We know that not all continuous functions are differentiable. For example, 
the absoute value function is continuous but is not differentiable at 0: 


However, if a function is differentiable then it is continuous: 


Theorem 15.1.1 Suppose f is a function defined on a set S C R and is 
differentiable at a point p € S. Then f ts ccontinuous at p. 


Proof. Recall that the derivative f’(p) is given by 


xp r—p 


Note that this is meaningful only when p € S is not an isolated point of S. 
We wish to show that f(x) > f(p) when x — p. For this we first write f(z) 
as f(p) plus the amount it deviates from f(p): 


f(x) =f) + f(z) —f(p). 
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(x)— 


Since we have information about fake let us bring this in: 
f(x) -f 2] 


fe) = 40) + @-p) [=F 


Now let x > p: 


lim f(x) = f(p) +0: f"(p) = fp), 


xL—>p 


where we used the given fact that f’(p) exists and is finite (the application 
of the ‘limit of product equals product of limits’ argument would fail if f’(p) 
were 00 or —0o). |QED 

We have allowed infinite values for derivatives. For example, the function 
f specified by 


—1 ifx<0; 
f(s) =< 0 ie 
1 in SU; 
has derivative 
f'(0) = o. 


But note that f is not continuous at 0. 

There are plenty of functions that are continuous everywhere and yet 
differentiable nowhere. They are hard to visualize and have severely zig-zag 
graphs. 


Chapter 16 


Using the Algebra of 
Derivatives 


In this chapter we use the basic algebraic rules for working with derivatives. 
In summary the basic algebraic rules are 


dU+V) du, av 


dx dx dx 
d(kU) _ du 
dx — dx 
d(UV) dU dV 
_ d l 
eo a V+U ce (product rule) (16.1) 
dz _ iw 
dx V2 dz 


U. dU _ yyav 
dy = Ve Ura 


dx V2 


(quotient rule) 


where U and V are differentiable functions for which all the quantities on 
the right sides exist and are finite, and k is any constant (real number). We 
have retained some redundancy here; for example, that d(kU)/dx = kdU/dx 
can be deduced from the ‘product rule’ for d(UV)/dx, keeping in mind that 
dk /dx = 0. 
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16.1 Using the sum rule 


The sum rule is so easy to use that it is best forgotten that we are actually 
using a rule (that needs to be proved). For example 


d(5sina+4x%—3)  d(Ssinz) | d(4x* — 3) 


dx da dx 
(assuming these derivatives exist and are finite) 
_ ,dsin x | d(4x*) | d(—3) 
dx dx 
(assuming these derivatives exist and are finite) 


dx 


d 3 
—Kesegeka a6 
dx 


= 5cosx + 122”. 


For this very first example we were careful to state most of the steps and 
logic, but there is no need to be so extreme. Mostly we can write down the 
derivative of a sum directly: 


d(5sinz — 3x° + 2x — 4) 


= 5cosx — 15a* +2. 
dx 


16.2 Using the product rule 


The product rule gets us to more complicated functions. Take for example, 
y = Vasinz. 
Using the product rule we find the derivative very easily: 
dy d/zx., dsin x is 
= Sl = , 
ae Ae nz+/x ae Ke J Cos x 
We can use the product rule for multiple products: 


d(UVW) _ duUV(W) 


dx dx 
as dy dV W (16.2) 
dx dx 
a Ww yew | pi 
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Clearly this patterns works for any number of products. 
Here is another example of the product rule 


d(t + tan t)(2t? + \/4)(4t — cost) 


dt 
_ d(t+tant) 5 (2t? + /t) 
= goo Vt)(4t — cost) + (t + tan 2 ar as 
Cee Oeil - ost) 
= (1+ sec? t)(2¢? + Vt)(4t — cost) + (t + tan t)(4t + ae — cost) 


+ (t+ tant) (2t? + V#)(4 + sint). 


16.3 Using the quotient rule 


Let us first see quickly how to use the formula 


Here is an example 


d 1 7 1 d(sin w + cosw + 1) 
dw 


sinw +cosw+1 


(sin w + cos w + 1)? dw 


7 ~ (sin w + cos w ppp toosw Uy 


Now let us do an example of the full quotient rule 


ice VE VE 


(U/V! = 


We have 


d(sin w+cos w+1) 
dw 


1) Ketan) — (w+ tanw) 


(sin w + cosw + 1)? 
(sin w + cos w + 1)(1 + sec? w) — (w + tanw)(cos w — sin w) 
(sin w + cos w + 1)? 


d ( w + tan w ) (sin w + cos w 


dw \sinw+cosw + 1 7 
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Chapter 17 


Using the Chain Rule 


The chain rule, along with the algebraic rules we have already studied, makes 
it possible to work out derivatives of functions given by highly complicated 
expressions. 


17.1 Initiating examples 


Before stating the chain rule in the abstract we can see its essence in a few 
basic examples. As a first example, look at 


dsin(x? + 32) 


Tan = [cos(x? + 32)| (2a + 3) 


The sin has become cos (which is the derivative of sin), and then we have a 
multiplier 2x + 3 which we recognize to be the derivative of x? + 32. 
Next look at 


d(4t3 —2sint)'/* 1 
_ 3 (4t° —2sint)'/?-? . (12t? — 2cost). 


We recognize the factor on the right as the derivative of the function (-)!/3, 
evaluated at 4t? — 2sint; next, this is multiplied by the derivative of 4t? — 
2sint. 


As our last initiating example, look at 


dsin (v t+ 2?) 
dx 


1 


Tee 


= [cos(vrt +2?) |. 


137 


138 Ambar N. Sengupta 11/6/2011 


This is a chain rule applied twice: first the sin is differentiated to obtain 
cos(-), next /- is differentiated to produce aaa and, finally, x* + 2? is differ- 
entiated to produce the last factor 4x? + 22. 


17.2. The chain rule 


Consider a function H that is the composite of functions F’ and G: 


This means that to calculate the value H(x) we must first work out the value 
G(2) 
and then apply the function F' to it: 
For example, the function given by 
NA cae 
is the composite of the square root function ,/- with the function given by 
xi + 2’. 
As another example, 
sin /w 
is the cmomposite of sin with \/-. 


The composite 
H=FoG 


is the function whose value is given by 
H(t) = F(G(t)), 


for every value t for which G(t) is defined and F(G(t)) is also defined. For 
example, the composite function given by 


Vl+2 


is defined only for « > —1. 


DRAFT Calculus Notes 11/17/2011 139 


The chain rule says that if 
H=foeG 


then 
H'(x) = F'(G(2))G'(z) (17.1) 


provided that values and derivatives on the right exist and are finite. 
Returning to examples, we have then 


dtan/x 2 fa ae 
oe ei [sec Vax +32? | a Ja 


because we recognize that 
tan f/x 


is obtained by applying tan to /Z: 


tan Vx = (tano,/)(z). 
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Chapter 18 


Proving the Algebra of 
Derivatives 


We now explore precise statements and proofs for the rules of algebra for 
derivatives. 


18.1 Sums 


If f and g are functions with domain S C R then their sum is the function 
f +g on S whose value at any x € S is given by 


(f+ 9)(@) = fle) + g(@). 


Suppose now that p € S' is a point where the derivatives f’(p) and g’(p) exist. 
Then 


(f +9) (P) = fe) + 9'(v) (18.1) 
if this sum is defined (that is not 00 + (—oo) or (—oo) + 00). 
This result follows directly from the fact that the limit of a sum is the 
sum of the limits. 


18.2 Products 


If f and g are functions with domain S C R then their product is the function 
fg on S whose value at any x € S is given by 


(f9)(x) = f(e)g(@). 
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For proving the product rule we have to bring in a useful little geometric 
observation about products. Notice that if a rectangle, whose sides are A 
and B, gets enlarged so that it becomes a C’ by D rectangle then its area 
increases by 


CD-—-AB=CD-—AD+ AD-AB 
=(C—A)D+A(D-B) 
=(C— A)(D—-B+8B)+A(D-B) 
=(C— A)(D- B)+(C-A)B+ A(D — B). 


(18.2) 


Proposition 18.2.1 Let f and g be functions on a set S CR, and atpe S 
is a point where f and g are both differentiable (that is, the derivatives f'(p) 
and g'(p) exist and are finite). Then 


(f9)'(x) = f(z)g(@) + Flz)g'(2). (18.3) 
Proof. Recall that for any function h the derivative h’(p) is defined to be 


h(x) = lim na he) 


wx WwW—-zr 


Let us apply this to h = fg. Then 


(fg)/(w) = lim f(w)g(w) — Flx)g(z) 


wr W—-2 


(18.4) 


Now we split the numerator following the idea of (18.2): 


f(w)g(w) — flx)g(@) = [Fw) — F@) Ig) — 9@)| 
+ [f(w) — f@)| g(@) + fF) (gw) — g(@)I. 


Now divide by w — x to obtain 


fw)g(w) = Fl@)g@) _ Ff (w) — f(a)] an = so) 
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The ratio Le 5 @) approaches the derivative f’(2) when w — x, and similarly 


g(w)—o(e) 


for , SO we rewrite everything in terms of these ‘difference quotients’: 


f(w)g(w) — Fl@)g@) 


WX 


= (w — 2) 
| f= I) , gpg = ale) 


f(w) = ad le % a) 


WX 


Now just let w > x: 
iy, Late) = FDI) _ 9. 62). gy 
+ f'(x)g(x) + flx)g'(@), 


which works because the derivatives f’(x) and g'(x) have been assumed to 
exist and be finite. Thus fg is differentiable at « and 


(f9)'(x) = f'(@) g(a) + f(x)g'(2). QED 


18.3 Quotients 


We turn now to proving the quotient rule: 


g(a) f'(@) — Fle)g'(2) 
g(x)? 
valid whenever f and g are functions defined on some common domain S 


and x is a point of S where f and g are both differentiable and g(x) 4 0. 
From the definition of the derivative, (f/g)/(x) is the limit of 


(f/9)'(«) = 


(18.5) 


fw) _ f() 
g(w) g(x) (18 6) 
w-o2x 


as w — x. Let us rework the numerator so it involves mainly the differences 
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a 
S 
| 
a 
= 
+ 
pa 
& 
oe 
& 
| 
Za 
= 
= 
= 
| 
a 
= 
+ 
= 
= 


The preceding algebra may look very complicated at first but it has a natual 
and simple thinking behind it: at each stage where we see f(w) we write it 
in terms of the difference f(w) — f(x) as f(w) — f(x) + f(x), and the same 
for g(w). 

Thus 


= (18.7) 


Now we let w > x. To deal with the denominator term g(w), we need to 
use the fact the the differentiability of g at 2 makes it continuous as 2: 


g(w) — g(x), 


as w + x. Applying this to the identity (18.7) produces 


fon AHS) _ FCadol) - Faw 
wor w-2 g(x)g(x) , 


which is what we wished to prove. |QED 
If in the quotient rule we take the numerator to be the constant function 
1 we obtain 


V-Vv-1-V’ 0-—YV’' 1 
V2 —— SD 


= 


a useful formula in itself. 


Chapter 19 


Proving the Chain Rule 


It is easy to understand why the chain rule works. But, as we shall see, 
turning this easy understanding into a proof runs into a snag. For the official 
proof we then follow a different line of reasoning. 


19.1 Why it works 
Consider the composite function Fo G: 
y = F(G(z)). 
To work out the derivative dy/dzx, let us introduce some notation: 


E(w) where w= Ge). 


Then 
y = F(G(«)) = A(2), 
and x Rtas 
7] y au 
See eS eee 19.1 
Ax Au Az ao) 
Letting Ax — 0 gives 
dy  dydu 
— = ——_. 19.2 
dx dudz ( ) 


For example, for the function 
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we take u = 1+sinz, and then 
y=Vu and u=1+sinz, 


so that 

dy dydu 1 1 

dx dudr  2/u oe 2/1 +sin x iid 
This is clearly the chain rule as we have seen and used before. 

The argument used above is natural but has one technical gap. What if 
the denominator Au = AG(x) keeps hitting 0? Then the first ratio on the 
right in is undefined and the argument breaks down. 

There are two ways to deal with this road block. Either we can try to 
tread through the obstacle very carefully, or we can try an entirely different 
pathway, one that is less natural but one that gets us to the destination faster. 
We will go through such a proof in the next section. This is a situation one 
sometimes faces in trying to construct a proof out of a reasonable idea. In 
the end, however, the ‘reasonable idea’ often gives better insight into ‘why’ 
the result is true. (In fact, the idea does provide a proof in the case G’(x) is 
not 0.) 


19.2 Proof the chain rule 


A proof of the chain rule can be built out of the following useful observation: 


Lemma 19.2.1 Let f be a function, with domain S CR, differentiable at a 
point p € R. Then there is a function f, on S, which is continuous at p, and 
for which 


f(x) = fp) + (e@-p)fp(z) — fer allz eS, (19.3) 
and 
fo(p) = f'(P) 
There is a more enlightening way to state (19.3): 
f(z) =f@)+(—p)f'e)+e(z)  forallze S, (19.4) 
where 
lim £2) _ 9 (19.5) 
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because €,(%) = (x — p)[f,(a) — f’(p)|. To understand the significance of 
(19.4) observe that the first two terms of the right describe the y-value of the 
tangent line to y = f(a) at p, and so condition (19.4) says: 


lim — = 0, (19.6) 


where 
def 


y= T, f(x) = fp) + (@—p)f') 
is the equation of the tangent to the graph of f at (p, f(p)). 


tabgent line y = f(p) + («@ — p)f'(p) 


| 
/ > 2x 
6 


Figure 19.1: The tangent as an approximation to the graph 


Proof. Simply note that 


t) — fp 
fe) =F) + Fe) - 70) = F0) + =») |Z), agin 
for all x € S with x # p. We want to denote the ratio Ben by J(2): 
However, we need to say what f,(p) is, for at « = p the ration fe) f@) is 


not defined. But, we do know that as x —> p, the ration pan approaches 
f'(p). So let us define the function f, on S by 


f(a) f) iiee S and ag p 
f(z) = cs (19.8) 


f'(p) if c= p. 
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Then 
meno 


«tp «rp x—p 


= f'(p) = pl), 


which shows that f, is continuous at p. Moreover, from (19.7), 


f(x) = f(p) + (@ — p)fp(2) 


for x € S wthe x # p. But putting in x = p shows that this is also valid 
when x = p. |QED 
Now we can prove the chain rule. 


Proposition 19.2.1 Let f and g be functions on subsets of R, and let So be 
the set of all x for which the composite 


fog:a f(g(z)) 


is defined. Suppose that p is a point of So such that g is differentiable at p 
and f is differentiable at g(p). Assume also that p is not an isolated point of 
So. Then the function f og is differentiable at p and 


(f 0g9)'(p) = f'(9(p))9'(p). (19.9) 


We can avoid all the worrying about domains if we simply assume f o g 
is defined in a neighborhood of p. Thus, if f og is defined in a neighborhood 
of p, g is differentiable at p and f is differentiable at g(p) then f og is 
differentiable at p and holds. 

Proof. Let 
q = g(p). 


Recall from Lemma |19.2.1/the functions f, and g,. Then 


f(9(z)) = FQ) + (92) — 9) Ae(9(2)) 


= f(q+ aaa p(t) — 4) fa(g(2)) (19.10) 
= f(g) + («- )fa(9(@)) 

for all x in the domain of f og. Then 
Hote) — HOP) _ 6 (a) f,(a(0) 


u—?p 
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for all x € So with x # p. Now letting x — p (recall that p has been assumed 
to be a limit point of So) we have 


f(g(@)) — f(g(p)) 


He r—Dp ~ lim gp(@) fq (9()) 
= 9p(P) Falg(p)) oy 
Yn 


g'(p) f’ (o)) 


where in the last step we used several observations and facts: (i) because gp is 
continuous at p the limit lim,_,, 9,(@) is g,(p), (ii) f, is continuous at g and g 
is continuous at p (because g is differentiable at p) and so lim,-,, fy (9(z)) = 
fa(g(p)); Gii) both gp(p) = g'(p) and fa(g(p)) = fala) = f'(@) are finite (real 


numbers). 
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Chapter 20 


Using Derivatives for Extrema 


Derivative can be used to find maximum and minimum values of functions. 
For example, we will see soon how to find maximum and minimum values of 


g(x). = 22° — 627? +1 


OF. |=1 1 |. 
For a function f on a closed interval [a,b], where a,b € R and a < 5, here 
is the strategy for finding maximum and minimum values: 


(i) work out the derivative f’(x) and find the values of x in the interval 
[a, b] where f’(x) is 0; 


(ii) the maximum (minimum) of f is the largest (smallest) of the values of 
f at the points where f’ is 0 and the endpoint values f(a) and f(b), 


assuming that f is continuous on [a,b] and is differentiable in the interior 
(a,b) (or at least at those points in the interior where f reaches maxi- 
mum/minimum value). 

To see how this works let us apply it to the function g(x) = 22° — 62? +1 
for x € [—1,1]. The derivative is 


g(x) = 6x? — 12% = 6x(x — 2). 


This is 0 at x = 0 and at x = 2. Snce x = 2 is outside the domain [-—1, 1], 
we ignore it. Now we compute the values of g at x = 0 and at the endpoints 
1 and —1: 
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The largest value of g(x) is therefore 1, occurring at x = 0, and the smallest 
value is —7, occurring at x = —1. 

The proof that this strategy works is postponed to Chapter [21] 

If the function f is defined on the whole real line R or intervals such as 
(2,00), we have to modify step (ii) above to: 


(ii)’ the supremum (infimum) of a function f, defined on an interval U, is 
the largest (smallest) of the values of f at the points where f’ is 0 and 
the endpoint limit values lim, ,, f(a) and lim,_,, f(b), where a and 6 
are the endpoints of the interval U and the limits here are assumed to 
exist. 


If the limits lim,_,, f(a) and lim,_,, f(b) don’t exist then, of course, this 
method doesn’t work. 

The term ‘maximum’ is used when the supremum is actually attained at 
a point in the domain of the function; similarly, we speak of the ‘minimum’ 
value of a function if the infimum is actually attained in the domain of the 
function. For example, 1/z, for x € (0,00), has no maxima or minima but 
its supremum is oo (as x | 0) and its infimum is 0 (as 7 > oo). 


20.1 Quadratics with calculus 


Let us apply the method of calculus to find the minimum value of the 
quadratic function 
y(x) = 32” — 6x + 16 


The derivative is 
6x — 6 


and this is 0 when x = 1. The value of y here is 
yi 1) = 1B. 


There are no boundary points given, and our function is defined on the entire 
real line R. So we need to work out the endpoint limits: 


lin (2) = 0 and lite aC) = Go. 
00 ~——00 


Thus 


sup y(x) = 00, 
«cER 
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and 


20.2 Quadratics by algebra 


There is a way to obtain the minimum value of a quadratic function by pure 
algebra, with no use of calculus. This is by using the ancient method of 
‘completing the square’: 


32° —6¢-+16 = 3(27 —2%)416 = 3(27=—22-+1 —1)416 = 3(2°—2¢+1)—3-+16 


which shows that 


32° — 62 + 16 = 3(2 — 1)? + 13. 


The first term on the right is always > 0, with minimum value 0 when x = 1. 
Hence 32? — 6x + 16 has minimum value 


04+ 13 = 18, 


and this value is attained when x = 1. 
Let us now look at the general quadratic 


Az? + Bx +C, 


with A, B, C being real numbers, with A 4 0. Completing the square we 
have 


Ar? + Bo+C= Ala? + Fe] +C 
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Thus, 


(20.2) 


2 2 
Art + Bo+O=Ale+ 5 i eae) 


2A 4A 

If A > 0 then the first term on the right is always > 0, with minimum value 

occurring at x = —B/(2A): 

(B? — 4AC) 
4A 

If A is negative then the first term on the right in (20.2) is always < 0 and 

the largest it gets is 0, this happening when « = —B/(2A); so 

(B? — 4AC) 
4A 


This is a clean and nice solution, but in practice it is faster to simply observe 
that 


min( Ac? + Br+C)=- if A> 0. (20.3) 
rE 


max (Ac? + Br+C)=- if A <0. (20.4) 
rE 


(Ac? + Br+C) =2Ar+B 


is 0 when x = —B/(2A) and this point corresponds to the maximum/minimum 
value of Ax? + Br+C. 

The classic use of the method of completing the square is in obtaining 
the solutions of the quadratic equation 


Az? + Br+C =0 (20.5) 
Using the completed square form this reads 


a F| - PO = 


ar 1A 


from which we have ; 
in 7 (B? — 4AC) 
“TOA| ~~  4AP 


Taking square roots shows that 


_ B_ VB =TAC 
TT OA oA 


where + signifies that there are two choices. Thus the two solutions of the 
quadratic equation (20.5) are 


—B+V/B?-4AC 
a= 
2A 


—B-—/B?-—4AC 
2A 


and C= 
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Observe that 


VB? —4AC 


a-p= A 
The square of this is 
B? — 4AC 
(a 7 py = A2 
The quantity 
A*(a — 8)? = B? -4AC (20.6) 


is called the discriminant of the quadratic 
Az’? + Br +C. 


If the discriminant is 0 then (20.6) shows that a = 2. On the other hand if 
the discriminant is not 0 then the roots a and £ are distinct. 

If the discriminant is < 0 then looking at the expressions for a and 
we see that they are not real numbers (since square roots of negatives are 
involved). 


20.3. Distance to a line 


We work out the distance of a point 


P(ap, YP) 


from a line L: 
y=mart+k. 


This distance is, by definition, the shortest distance from P to any point on 
the line: 
d(P,L) © inf d(P 
(P,L) = inf d(P, @), 
where 
d(P, Q) = distance between the points P and Q. 


If Q has coordinates (x, y) then 


d(P,Q) = V(a — xp)? + (y— yp). 
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We have to find the minimum value of this as (a, y) runs over the line L. We 
can avoid unpleasant calculations by minimizing the distance squared: 


d(P,Q)? = (e — ap)’ + (y— yp)’. (20.7) 


Clearly if we can find the minimum value of this then we can just take the 
square-root to find the minimum distance. Keep in mind that in (oy) 
is on the line L and so 

y=mact+k. 


If we write out d(P, Q)? in terms of x we have 
d(P,Q)* = (a — xp)? + (mz +k — yp)’. 
This is clearly quadratic in «x: 


d(P,Q) = 2? —2rpr + x2, + m?z? + 2m(k — yp)z + (k — yey 
= (1+ m*)a* + 2[-ap + (k— yp)|a + 2p + (hk — yp)’. 


The coefficient of x? is 
1+m?, 
which is positive. From our study of quadratic functions we know then that 


d(P, Q)? does attain a minimum value and at the point Qo where it attains 
minimum the derivative Fi 
—d(P,Q)? 
+a(P,Q) 
is 0. 
The derivative of d(P,Q)? with respect to x is: 
d 


£ a(P,Q)? = 2a ~ xp) + 2(y— yp) = 20 — 2p) + (y — ue) 


This is 0 if and only if (x, y) is the special point 
Qo(Xo, Yo) 


which satisfies 


(xo — ep) + (yo — yp)m = 0. (20.8) 
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It is worth observing that this implies 


= 1 
Yoo YP (20.9) 


XL — Lp m 


assuming that m # 0 and that P isn’t actually on the line L. 
The geometric significance of (20.9) is that the line PQ has slope —1/m, 
and this means that 


PQ is perpendicular to the line L. 


Geometrically this makes perfect sense. 
Now returning to (20.8), we subsitute in the value of y as ma+k to obtain 


(xo — Zp) + (may +k — yp)m =0, 
which is 

(1+ m?)2p + km —ap—ypm=0. 
Solving this we obtain the following value for «x: 


_ &p+ypm—km 


= 20.10 
na) {4-92 ( ) 


The corresponding value for y is 


Yo = M2, +k 
—k 
=m (“eeu yk 
1+ m? 
_ mtp+ypm?—km?  k(1+m?) 


_ mept+ myp +k 


Thus 
MZ p + my p + k 


aU. 


Yo = 
We have thus found the point 
Qo(Zo, Yo) 


on the line L that is closest to the point P. 
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We can now work out the distance between P and Qo: 


d(P, Qo)” = (0 — xp)” + (yo — yr)? 
= [—m(yo — yp)|? + (yo — yp)? on using 
= m?(yo — yp)” + (Yo — yr)? 
= (m? + 1)(yo — yp)? 


(20.12) 


We need to work oit yo — yp from (20.11): 


map+myp+k  (1+m?)yp 
1+ m? 14m? 
map +m?yp +k — yp — m?yp 
1+ m? 
mip +k—yp 


Yo — YP 


1+ mm? 


Using this in the formula (20.12) for d(P, Qo)? we have: 


map+k—yp\* — (map +k— yp) 
1+ m? 7 1+m™? 


d(P, Qo)? = (m2 +1) ( 


Taking the square root produces at last the distance of P from the line L: 


= |maxp + k — yp| 


V1+m? 


Taking a concrete example, let us work out 


d(P, L) 


(20.13) 


the distance of the point (1,2) from the line y = 5x — 2. 
This works out to 
Jbx1-2-2) 1 
V1 45? V26 


The absolute value in the numerator erases a piece of information. In this 
example, 


oet1l=2>2 


and this means that the point (1,2) lies below the line y = 5x — 2. 
In fact, 
mip +k —yp 
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measures how far ‘below’ (in the vertical y-direction) the line L the point P 
lies. If @ is the angle between L and the positive x-axis then 


m = tana 
and so 
1 1 1 | | 
= = =|cosa|. 
Vl+tm2 V1+tan?a WVsec2a 
Thus 


lmtp +k —yp| 
JV1+m? 
If you view this geometrically it is clear that this does indeed measure the 
distance between P and the line L. 
Now consider a different way of writing the equation of a line: 


Az + By+C =0, 


where at least one of A and B is not 0. Assume that B ¥ 0 (if B were 0 then 
the line would be ‘vertical’, parallel to the y-axis). Then we can rewrite the 
equation as 


= |(mzp + k — yp) cosa. 


A rf —C 
= ——7 + —_. 
a= Bh = 
So, to switch back to our previous notation, 
A —C 
see B 


Then the distance between P(xp, yp) and L is 


|map+k—-yp| _|—%te— vel 


\/ B2 


Simplifying the algebra this produces the formula 


| Ax p + Byp = C| 
dP.) = ; 20.14 
Se aM as ee 
You can check that this formula works even when B is 0, for then the line 
L has constant « value —C/A and the z-coordinate of P is xp, so that the 


distance is 


Ize — (-C/A)| = 
which matches (20.14) for B = 0. 
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Figure 20.1: Distance of a point P from points on a line. 


20.4 Other geometric examples 


A straight piece of wire of length L units is to be cut into two pieces, one 
of which will be bent into a square and the other a circle. What is the 
maximum and what is the minimum possible total area (enclosed by the 
square and circle) that can be enclosed in this way? 

Intuition suggest that the largest area would be obtained if we take the 
entire wire and bend it into a circle. This intuition (where does it come 
from?) is verified to be correct by the mathematical solution we work out. 
It is not clear intuitively how to cut the wire to obtain the minimum wire. 

Let x units be the length of the piece that is bent into a circle. Thus if 
the radius of the circle is R then 


27R= x, 


and so the area enclosed by the circle is 
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The remaining piece is of length 
L=2 


and when this is bent to form a square, each side of the square has length 


L-« 
ae 
and its area is 
LGN 
cer 
Thus the total area enclosed is 
A(x) = a + a(t — 2)’. (20.15) 


We have to find the maximum and minimum values of A(x), keeping in mind 
that x cannot be negative or more than L: 


x € (0, L}. 


(Taking xz = 0 means we just form a large square and no circle, and taking 
x = L means we form a circle out of the full length of wire and no square at 
all.) 

The derivative of A is: 


1 1 —-L 4 —7L 
Ale) = 99 O(a) (1) = x a = C+ TL — 7 


At 16 2 8 87 ; 
which simplifies to 
wey (4+ 7)x — ue 
8 


Thus 


the solution of A'(r) = 0 is tp = 4GL. 


The remaining length L — x to be bent into a square is: 
1 4 
= L. 
a+A4 wr+aA 


L-x=L- 


The total area enclosed is 


1 1 Po ol 4 : 
= L 20.1 
A(x0) At (2,2) 16 (— ) oo 
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Simplifying this we have 


A(z») = a; eee es 
0! Anta +42 16(r +4)? 
= a 2 1 2 
A(m + 4)? (7 + 4)? 
us 2 4 2 
— 20.17 
An+4) * Un+4) eat) 
_ wt4d go 
~ A(n + 4)? 
A(m + 4) 
We compare this value 
L? 
A = —_ 
(70) = FTG 


with the endpoint values obtained from (20.15) with « = 0 and x = L: 


if i? 
er and A(L) = Tn 


A(0) 
Among these values A(L) is the highest, having the smallest denominator 
47, and A(xo) is the least, having the largest denominator 47 + 16. 

Thus the largest area is enclosed when x = L, which means we take the 
entire length of wire and bend it into a circle. 

Now consider another problem. A rectangle of sides L units by W units 
has four little squares, each having side x units, cut out of the four corners; 
the edges are now folded to form a box (with no cover). The height of the 
box is « units, and the edges are L — 2x7 units and W — 2a units. What 
should « be to maximize the volume of the box? 

The volume V cubic units is given by 


V(x) = 2(L — 22)(W — 22) = LW — 2(L+ W)s? + 42° 


The value of « is > 0 but cannot be more than W/2 (we assume W is the 
shorter edge, that is: W < L). Thus, 


x € [0,W/2]. 
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The derivative V’(x) is 
V'(e) = LW —4(L4+ W)e2 4+ 122° 
For this to be 0 we have 
122? 4(L+ W)z+LWw =0. 


The solutions are 


4(L+W)+,/16(L+W)?—4%12%* LW _4(L+W) + V16((L + W)? — 3LW] 


24 24 
which simplify to 


(L+W)+VI?+W2-LW 
6 


We need to check if these two values fall within the interval [0,W/2]. Since 
L > W the numerator for the + sign is 


(L+W)+VP?4+W?2-LW=L+W+4+VJ/L(L—-W)+w? 
>W+W+v04+W? 
= 3W 


which makes the ratio 


(L+W)+ VL? 4 Wie iW 3 8 MM 
6 = iG 
falling outside the allowed range, unless we have the extreme case L = W 


for which the ratio is W/2. On the other hand, if we take the — sign, then 
the numerator is 


(L+W)-/(L—-W)?+ LW 
and the term being subtracted is larger than \/(L — W)? = L — W, and so 


(L+W)-f/(L—-W)2+ LW <(L+W)—-(L-W) =2W, 


and so then the ratio is 


(L+W)-VP+W?-LW ww Ww 


6 6 a 
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Thus 


(L+W)-J/1L?+W?- LW 
6 
is in the interior of [0,W/2]. Clearly this choice of « must produce the 
maximum value of the volume, for the value of V(x) at the endpoints x = 0 
and + = W/2 is 0. 


Thus the maximum volume is 


V (xo) = %o(L — 229)(W — 220) 


Ly = 


After a long calculation this works out to 


1 
= |(zZ +W)(5LW — 20? — 2W?) +2(L? + W? — LW)VL2 + W2— LW). 


If we start with a square, for which L = W, this simplifies to 
2 
Es 
a aad 

with xo being L/6. 


Exercises on Maxima and Minima 
1. Find the maximum and minimum values of x? for x € [—1, 2]. 
2. Find the maximum and minimum values of 
x(6 — x)(3 — 2) 
for x € (0, 2}. 


3. A wire of length 12 units is bent to form an isosceles triangle. What 
should the lengths of the sides of the triangle be to make its area 
maximum? 


4. A piece of wire is bent into a rectangle of maximum area. Show that 
this maximal area rectangle is a square. 


5. A piece of wire of length L is cut into pieces of length 7 and L — x 
(including the possibilty that x is 0 or L), and each piece is bent into a 
circle. What is the value of x which would make the total area enclosed 
by the pieces maximum, and what is the value of x which would make 
this area minimum. 
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6. Here are some practice problems on straight lines and distances: 


(i) Work out the distance from (1,2) to the line 3x = 4y + 5 
(ii) Work out the distance from (2, —2) to the line 4x — 3y —5 = 0. 


(iii) Find the point Pp on the line L, with equation 3x + 4y —7 = 0, 
closest to the point (0,3). What is the angle between PoP and 
the line L? 

(iv) Let Po be the point on the line LZ, with equation 37 +4y—11 = 0, 
closest to the point P(1,3). What is the slope of the line P,P? 

(v) Let Po be the point on the line L, with equation 3x + 4y —11 = 0, 
closest to the point P(1,3). Find the equation of the line through 
P and P. 


7. Prove the inequality 
fae 3/2 


oe 

3 1327 
for all x,k € (0,00). Explain when > is =. [Hint: Show that, for any 
fixed value k € (0,00), the maximum value of 


ke, (20.18) 


ve 
(x) =ke- => for x € (0,00) 


. oe Note that ®(0) = 0 and lim,_,,, ®(x) = —oo; so you have to 


find a point p € (0,00) where ®’(p) is 0 and compare the value ®(p) 
with ®(0) and choose the larger. | 


8. Prove the inequality 
a + 5k8° > 6ka, (20.19) 


for all x, k € (0,00). Now show that 
x + 5y® > bya, 


for all x,y € (0,00). 
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Chapter 21 


Local Extrema and Derivatives 


21.1 Local Maxima and Minima 


Consider a function f on a set S C R. Suppose u € S is such that the value 
of f at wu is maximum: 


f(u) = sup f (2). 


res 


Assume that this point u acually lies in the interior of S: 
ue S®, 


Then, in any neighborhood U of u, there are points of S to the right of u 
that are in U and there are points of S' to the left of d that are also in U. 
Then for any x € U to the right of u we have 


f(a) = fw 


xr U 


zy) 


because both numerator and denominator of the fraction on the left are > 0. 
On the other hand, for « € U to the left of wu we have 


fle) = Flu) <4 


because the numerator is > 0 but the denominator is < 0. Thus, on any 
neighborhood U of u, the sup of the ratio La) is > 0 and its inf is < 0: 


fle) = fw) 


168 Ambar N. Sengupta 11/6/2011 


But this just means that the line of slope 0, through the point P = (u, f(u)) 
on the graph of f, satisfies the condition for quasi-tangents. 

A very similar argument works for a point 6 where f is a minimum. 

In fact these arguments easily establish: 


Proposition 21.1.1 Suppose f is a function on a set S CR, andbe€ S is 
a point in the interior of S. Let U be a neighborhood of b contained in S and 
suppose 

fio) < f(z) for alla €U. 


If, moreover, f'(b) exists then 
f(b) =0. 


Suppose u € S' is such that there is a neighborhood U of u with U CS 
and 
f(u) => f(z) for alla EU. 


If, moreover, f'(u) exists then 
fi(u) = 0. 


For a function f defined on a set S C R, a point b € S is said to bea 
local minimum if there is a neighborhood U of b on which the value of f at 
b is < all other values: 


f(b) < f(x) forala eUuns. 


A point wu is said to be a local maximum of f if the value f(w) is > all other 
values over U: 
f(u)> f(x) foralaeUuns. 


With this terminology, Proposition says that if a function has a 
local minimum or a local maximum in the interior of its domain of definition, 
and if the graph has a tangent line at that point, then the this tangent line 
is horizontal, that is, it has 0 slope. 

Recalling the term ‘quasi-tangent’ we introduced back at the end of sec- 
tion we see a sharper form of the preceding proposition: 


If f is a function, defined on a set S, that has a local minimum 
or a local maximum at a point p in the interior of S, then the line 
through p of zero slope is a quasi-tangent to the graph y = f(x) 
at the point (p, f(p)). 
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Review Exercises 
1. For the set 
S = [-00, -1) U (1, 2] U {6,8} U [9, oo] 


write down 


(i) an interior point 
(ii) a limit point 
(iii) a boundary point 
) 

) 

) 


(iv) an isolated point 
(v) the interior S° = 


(vi) the boundary 0S = 


2. Answer and explain briefly: 


(i) If 4 < sup T is 4 an upper bound of T”? 

(ii) In (i), is there a point of T that is > 4? 

(iii) If inf 7 <3 is 3 a lower bound of T? 
I 


(iv) In (iii), is there a point of T that is < 3? 


3. Answer the following concerning limits, with brief explanations: 


(i) If lim,,; F(x) = 2 does it follow that F(1) = 2? 
(ii) If g is continuous at 3 is g differentiable at 3? 
(iii) If g is differentiable at 5 is g continuous at 5? 
(iv) If h’(5) = 4 and h(5) = 8 then lim,,5 h(x) = 

(v) If H’(2) =5 and H(2) = 3 then lim, ,. 2” = 
(vi) If G’(5 " 1 and G(5) = 6 then lim,-,5 anes as 
(vii 


(viii) lim,—7/3 


oS 


sin w—sin(7/3) __ 
w—7/3 _ 


) 
) 
) 
) 
) 
i) 
i) li 
) 


170 Ambar N. Sengupta 11/6/2011 


(ix) If G’(3) = 4 then 


4. Work out the following derivatives: 


5. aa oe 
OO ar re 


(ii) d[(1+,/y) tan y] 


(v) eee) 


5. Using the definition of the derivative, show that 


d(1,/Z) 1 


dx Pig tem 


Chapter 22 


Mean Value Theorem 


In this chapter we explore a very powerful result in calculus: the mean value 
theorem. This result shows that for any differentiable function f, the slope 
of a secant line PQ is actually equal to the slope of a suitable tangent line 
to the graph y = f(z). 


Figure 22.1: A tangent line parallel to a secant. 


22.1 Rolle’s Theorem 


The following is a version of Rolle’s theorem, which is a key step towards 
proving the mean value theorem: 


Theorem 22.1.1 Let f be a function, continuous on the interval |a, b], where 
a,b€R with a < b, and suppose 
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Suppose also that the derivative f'(x) exists for all x € (a,b). Then there is 
a point p on the graph of f over (a,b) where there is a tangent line of zero 
slope: 

ij@=o0 for some p € (a,b). 


Proof. Since f is continuous we know that there is a point in [a,b] where it 
reaches its maximum value and a point in [a, 6] where it reaches its minimum 
value. 

Let us first see that at least one of these values must occur at a point p 
in (a,b), the interior of the interval. If both the maximum and the minimum 
of f were to occur at the end points a and 6, then, since f(a) = f(b), the 
function f must be constant, say with value K; picking any p € (a,b) we 
have f(p) = K, which is both the maximum and the minimum of f. Thus 
in all cases there is a p € (a,b) such that f attains either its maximum or its 


minimum value at p. Then, by Proposition 21.1.1| f'(p) =. |QED 

There is a small sharpening of Rolle’s theorem we could note, just to see 
how proofs can be tweaked to sharpen results. Recall that if f is continuous 
on [a,b] and attains either a maximum or a minimum value f(p) at p € 
[a,b], then there is a quasi-tangent line at (p, f(p)) to the graph y = f(z) 
that is flat. Thus, we could drop the requirement in Rolle’s theorem that 
f is differentiable and conclude that there is a point p € [a,b] where the 
graphy = f(z) has a flat quasi-tangent line. 


22.2 Mean Value Theorem 
The following is the enormously useful Mean Value Theorem: 


Theorem 22.2.1 Let f be a function, continuous on the interval |a, b|, where 
a,b € R with a < b. Suppose also that the derivative f'(x) exists for all 
x € (a,b). Then there is a point on the graph of f over (a,b) where there is 
a tangent line that has slope equal to 


f(b) — f(a) 
b-a 


Thus, 
for some p € (a,b). (22.1) 
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Proof. Consider the secant line passing through the points A = (a, f(a)) and 
B= (b, f(b)). The slope of this line is 


M= ee (29,9) 


Its equation is 
y — f(a) = M(x — a). 


Consider now how high f rises above this line: 


H (ax) = f(x) — |M(a2 -—a) + f(a)] for all x € {a, 0}. (22.3) 


y = f(z) 
Bb, f (b)) Equation of line AB is 


y = M(x — a) + f(a) 


Figure 22.2: The height H of the graph of f above a secant AB. 


This function is continuous, being the sum of two continuous functions. 


Moreover, 
H(a)\=H(o) =. 


Hence there is a point p € (a,b) where the tangent line to the graph of H is 
flat: 
A'tp)=0. 


From the expression for H(x) given in (22.3) we have 
H'(x) = (0) — M. 
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(This makes geometric sense: the slope of H is the slope of f minus the slope 
of the line M(x — a) + f(a).) So the relation H’(p) = 0 means 


f'(p) -M =0, 


which means f’(p) = M. |QED 


22.3. Rolle’s theorem on R* 


The argument used to prove Rolle’s theorem (Theorem |22.1.1|) extends di- 
rectly to cover the case of functions defined on subsets of R*: 


Theorem 22.3.1 Let F : [a,b] + R* be a continuous function, where a,b € 
R* witha < b, with F(x) € R for all x € (a,b), and suppose 


Suppose also that the derivative F’(x) exists for all x € (a,b). Then 
F'(p)=0 _ for some p € (a,b). 


We will use this result in establishing one case of |’Hospital’s rule (Propo- 


sition |28.2.2)). 


Chapter 23 


The Sign of the Derivative 


In this chapter we harness the power of the mean value theorem and a 
very precise understanding of the notion of limit (as explored in Proposition 
to study the relationship between the sign (positive/negative/zero) 
of the derivative f’ of a function and the nature of the function f, whether 
it is increasing, decreasing or constant. 

Recall that for a function f defined on a set S C R, and a point p € S, if 
U is a neighborhood of p then part of U might not be inside S. So to work 
with values f(a) for x in the neighborhood U we must focus on x € UNS, 
which would guarantee that x does lie in the domain S' of f. 


23.1 Positive derivative and increasing nature 


Intuitively it is clear that a function is increasing whereever its slope is > 0, 
and it is decreasing wherever its slope is < 0. In this section we make this 
idea precise. 

The simplest observation on slopes and derivatives is that if f is an in- 
creasing function on an interval then its slope is > 0: 


Proposition 23.1.1 Let f be a function on a set SCR. 
If f is increasing on S, that is if f(s) < f(t) for all s,t € S with s <t, 
then f'(p) > 0 for allp € S where f'(p) exists. 
If f is decreasing on S, that is if f(s) > f(t) for all s,t € S with s <t, 
then f'(p) <0 for allp € S where f'(p) exists. 
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Proof. This follows directly from the definition of the derivative: 


Ae lO= TO) 


xL—>p rL—p 


If f is increasing then f(x) > f(p) when x > p (thus x — p > 0) in S and 
f(x) < f(p) when x < p (thus x — p < 0) in S. Hence the ratio Ae) is 
> 0, and so the limit f’(p) is also > 0. 

If f is decreasing then 


f(x) — f(p) 
Lp 


<0 

both when x > p and when zx < p, with x € S. Hence in this case f’(p) < 0. 

QED 
The following is a much sharper result going in the other direction: 


Proposition 23.1.2 Let f be a function on a set S C R, and p a point in 
S' where f'(p) exists and is positive, that is 


f'(p) > 0. 


Then there is a neighborhood U of p such that the f(x) > f(p) forx EUNS 
to the right of p and f(x) < f(p) forx €UNS to the left of p: 


f(x) > f(p) for alla €UNS for which x > p pea 
f(x) < f(p) for alla €UNS for which x < p ee) 
Thus, roughly put, if the slope of y = f(x) is > 0 at a point p then just 
to the right of p the values of f are higher than f(p) and just to the left of 
p the values of f are lower than f(p). 
Proof. Recall the definition of f’(p): 


zp £—-p 
If this is > 0 then the ratio 
f(x) — f(p) 
t—?p 
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Figure 23.1: Positive slope and increasing function. 


is also > 0 when z is near p, but # p (see Proposition |10.1.1} for this). Put 
more precisely, this means that there is a neighborhood U of p such that 


M9)-[0) 5 0 for alla €EUNS with x F p. 


If we take an x € UNS to the right of p, we have x — p > 0 and so the 


fi2l— sip) ==) [Ae te) > 0. 


This means f(x) > f(p) for such values of x. 
On the other hand, if  € UNS is to the left of p, we have x — p < 0 and 


so the 
f= 10)) << 
L—Dp 


This means f(x) < f(p) for such values of x. | QED 
Using this we can step up to another result going in the converse direction 


to Proposition |23.1.1 


icc) 


Proposition 23.1.3 Suppose f is a continuous function defined on an in- 
terval U, and suppose f'(p) exists and is positive (this means > 0) for all p 
in the interior of U. Then f is strictly increasing on U in the sense that: 
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f(t1) < f (a2) for all 21,22 € U with 71 < 22. 


If f' is assumed to be > 0 on U then the conclusion is f(a) < f(a). 


Proof. Consider 71,22 € U with x; < x2. By the mean value theorem there 
is a point c € (#1, %2) where the derivative f’(c) is given by 


We are given that f’(c) > 0 and we know that the denominator x2 — 2 is 
positive; hence so is the numerator: 


Fox) — fle) = (e— 2 L@9— fled) 
which shows that 
f (x2) > racoee 


If we assume only that f’ > 0 then the same argument shows that f(x2) > 


f(a1). |QED 


Here is a slight but useful sharpening of the preceding result: 


Proposition 23.1.4 Suppose f is a continuous function defined on an in- 
terval U, and suppose f'(p) exists and is > 0 for all p in the interior of U 
and f'(p) is 0 at most at finitely many p€ U. Then f is strictly increasing 
on U. If f' <0 in the interior of U and f'(p) is 0 at finitely many points 
pe€U then f is strictly decreasing on U. 


Proof. Assume f’ > 0 in the interior of U and f’ takes the value 0 at finitely 
many points. By Proposition [23.1.3] f is an increasing function on U in the 
sense that f(a) < f(b) for all a,b € U with a < b. Hence if f(s) = f(t) for 
some s,t € U with s < t then f would be constant on the interval [s,t] which 
would imply that f’ is 0 on this entire interval, contradicting the assumption 
on f’. This proves the result for f’ > 0. For f’ < 0 the argument is exactly 
similar (or observe that it follows from the case f’ > 0 by flipping the sign 
of f to —f). }QED 
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23.2 Negative derivative and decreasing na- 
ture 


The results of the preceding section can be run analogously for functions 
with downward pointing slope. 


Proposition 23.2.1 Let f be a function on a set S C R, and p a point in 
S where f'(p) exists and is negative, that is 


f'(p) < 0. 


Then there is a neighborhood U of p such that the f(x) < f(p) forx EUNS 
to the right of p and f(x) > f(p) forx €UNS to the left of p: 


f(x) < f(p) for alla EUNS for which x > p 


f(x) > f(p) for alla EUNS for which x < p (23.2) 


If f slopes downward along an interval then it is decreasing: 


Proposition 23.2.2 If f is defined on an interval [a,b], where a,b € R with 
a <b, and if f'(p) exists and is negaive, that is < 0, for all p € {a, b| then f 
is strictly decreasing on [a,b] in the sense that: 


f(a1) > f (x2) for all x1, %2 € [a,b] with x1 < Xo. 


If f' is assumed to be < 0 on [a,b] then the conclusion is f(a) > f(x2). 


23.3 Zero slope and constant functions 


Clearly a constant function has zero slope: the derivative of a constant func- 
tion is 0 wherever defined. One can run this also in the converse direction, 
but with just a bit of care. 

Consider a function G that is defined on a domain consisting of two 
separated intervals, on each of which it is constant: 


_ fi if2e (0,1) 
ate) = {1 if x € (8,9). 
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Then clearly 
G'(p) =0 for all p in the domain of G, 


and yet G is, of course, not constant. On the other hand it is also clear that 
G really is constant, separately on each interval on which it is defined. 


Proposition 23.3.1 Suppose f is a function on an interval |a,b], where 
a,b ER with a < b, and f'(p) = 0 for all p € [a,b]. Then f is constant on 
[a,b]. If f is defined on an open interval (a,b) and f' is 0 on (a,b) then f is 
constant on (a,b). 


One can tinker with this as usual. It is not necessary (for the case of |a, b]) to 
assume that f’(a) and f’(b) to exist; it suffices to assume that f is continuous 
at a and at b. 

Proof. Consider any 21,22 € [a,b] with 7, < x. Then by the mean value 


theorem 
f (a2) — f (#1) 
v2 — Ly 


= f'(O) 
for some c € (21, £2). So if f’ is 0 everywhere it follows that 


f (v2) — f(a) 


LQ — Ly 


=0, 
and so 
f (x2) — f(a1) = 9, 


which means f(x,) = f(xz2). Thus the values of f at any two different points 
are equal; that is, f is constant. | QED 


Chapter 24 


Differentiating Inverse 
Functions 


Often an equation of the form 
y = f(z) 


can be solved for z: 

Pet"), 
and f~! is called an inverse to the function f. If f is differentiable then 
formal common sense suggests that the derivative of the inverse function 


should be 
dx 1 1 


dy &  fi(a)’ 


For example, for 
yaa 


we have an inverse function given by the square root function 


c= Vy 
and its derivative should be 
de 1 1. 1 
dy 2 Qe Wy 
which certainly is the derivative of ,/y with respect to y. Of course, we need 
to avoid the points where dy/dz is 0 (or undefined). 
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Note that we have been referring to ‘an’ inverse function. For y = 2? 
another choice of inverse is given by the other ‘branch’ of square root: 


r= —4/y. 


Things could be really made messy by choosing an inverse function that 
switches wildly back and forth between the branches \/y and —,/y. This 
just means that we need to exercise some care about choosing a specific 
well-behaved branch as an inverse functions. 


24.1 Inverses and Derivatives 


Suppose f is a function on an interval U such that f’(x) exists for every 
x € U and is positive, that is f’(x) > 0 (alternatively we could assume that 
f’ < 0 everywhere on U). Let V denote the range of f: 


V= f(U) ={fle) : 2 € Ut. 


Since f’ > 0 on U, f is a strictly increasing function and so it has a unique 
inverse function 


fi:VOR, 


specified by the requirement that 


f(f-"'@)) =¥ for ally EV. 


Alternatively, 
fl(f(e)) =a forall eU. 


Proposition 24.1.1 Suppose f is a function defined on an interval U, such 
that f'(x) exists and is > 0 for all x € U, being equal to 0 at most at finitely 
many points. Then (f-1)'(y) exists for ally € V, the range of f, and 


(f")'@) =a— (24.1) 


where x = f~*(y); in we take the right side 1/f'(a) to be 0 in case 
f'() 1s Go, and cog J’ (x) 48.0. 
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Note that, as consequence, if f is differentiable (finite derivative) and has 
derivative either positive on all of U or negative on all of U then f~' is also 
differentiable. As usual, there is a corresponding result if f’ is < 0, being 
< 0 at all but finitely many points. 

Proof. Suppose f’ > 0 on U and is actually > 0 except possibly at finitely 
many points. Then by Proposition [23.1.4] f is strictly increasing. 

By Proposition the range V of f is an interval and the inverse 
function f~! is defined on V. Let p€ U and q= f(p) € V. Then 


Alp Sain fw) =f (@) 
(f-°)'(q) = lim yea 


(24.2) 


Writing x for f~'(y) and p for f~'(q) we see that the difference ratio on the 
right here is 


t—?p 
f(x) — f(r) 
Let us denote this by D(z): 
=p 


f(x) — Fp) 


for x € U and x ¥ p (note that then f(x) 4 f(p), ie either > or < than 
f(p) depending on whether x > p or x < p). Thus reads: 


(F-")'(@) = lim D(F"(y)). (24.3) 


The definition of f’(p) implies that 


, 1 1 
ran a = F@)-7@) ~ fi(p)’ Cee) 
x—p 


this being taken to be 0 if f’(p) = co and to be on if f’(p) is 0 (these extreme 
cases require some care). 


Now since f~! is continuous (by Proposition |11.3.2) we have: 
f(y) > pasy > 4, 


and we also know that f~!(y) # p when y # p. Then by Proposition 
we have 
lim D(f7"(y)) = lim D(z). (24.5) 


yd «t—>p 
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Combining this with the expression for (f~')/(q) in (24.3) and the limit value 


in we have: 
(fF Y@=aR5- (24.6) 


This completes the proof. | QED 


Chapter 25 


Analyzing local extrema with 
higher derivatives 


25.1 Local extrema and slope behavior 


Suppose a function f is defined on a neighborhood of a point p € R and f has 
a local minimum or maximum at p. As we have seen, if f’(p) exists it must 
be 0. In this secton we explore ways to tell whether p is a local minimum or 
a local maximum by observing the behavior of the slope f’. 

The basic idea is simple: if a continuous graph is sloping downward just 
to the left of a point p and sloping upward just to the right then it must have 
a local minimum at p. This is formalized in the following result. 


Proposition 25.1.1 Suppose f is defined and continuous on a neighborhood 
U of p € R, and the derivative f’ is > 0 to the right of p and < 0 to the 
left of p; more precisely, suppose f is differentiable on U except possibly at 
p, and f'(x) >0 forx €U withx > p and f'(x) <0 forx €U with x < p. 
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Then p is a local minimum for f. 
As an example, consider the function 


g(x) = x* — 42° +2 


y= a*—4o> +2 


Its derivative is 
g'(x) = 4x3 — 1227 = 427(x — 3), 
and this is 0 at x = 0 and at x = 3: 
g(0)=0 and -g'(3) =0. 
Let us look at the point x = 3. We see that 
Ag*(o —3) <0 ife <3: 
SO ae Ss, 


Then Proposition [25.1.1] implies that x = 3 gives a local minimum for g. 
What about the point « = 0? Observe that on the neighborhood (—3,3) 
of 0 we have 
4o*(a — 3) <0 for all x € (—3,3). 


Thus g(x) continues to decrease in value as x passes from the left of 0 to the 
right of 0, and 0 is not a local maximum or minimum. 
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Proof of Proposition |25.1.1]. Consider any x € U to the right of p, that is 
x > p; the mean value theorem (Theorem |22.2.1) says that 


f(x) — f(p) = (w@— p) fe) for some c € (p, 2). 
If f’ is > 0 on U to the right of p then f’(c) > 0, and so we see that 
f(x) -—f(p) >0 for xz €U, with x > p. 
Thus 
f(x) > f(p) for « €U, with x > p. 


On the other hand, taking « < p but inside U we have, again by the mean 
value theorem, 


f(x) — f(p) = (@—p)f"(©)_ for some c € (2, p), 


but observe now that « — p < 0 and f’(c) is given to be > 0 (for c is to the 
left of p); hence 


f(x) -—f(p) >0 for x €U, with x < p. 


Thus, 
f(x) > f(p) for « €U, with x < p. 


We have shown that f(x) is > f(p) for all « € U, both those to the left of 
p and those to the right of p. This means that f has a local minimum at p. 
QED 

By a closely similar argument we obtain the analogous result for local 
maxima: 


Proposition 25.1.2 Suppose f is defined and continuous on a neighborhood 
U of p € R, and the derivative f' is < 0 to the right of p and > 0 to the 
left of p; more precisely, suppose f is differentiable on U except possibly at 
p, and f'(x) <0 forx €U withx >p and f'(x) > 0 forx €U with x < p. 
Then p is a local maximum for f. 
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25.2 The second derivative test 


There is often a faster way to check whether a point p where f’(p) = 0 
is a local minimum of maximum: this is by simply computing the second 
derivative f"(p). If this is positive we have a local minimum at p, while if it 
is negative then we have a local maximum at p. 

For example, 


y = sin(z’) 
has derivative Sant 
gone) = 2x cos(x), 
which is 0 when x = 0, and second derivative 
d ¢ 2 
contr) = 2cos(x*) — 42? sin(x”) 


whose value at x = 0 is 2; then, without even knowing anything about the 
graph, we can say that sinz? has a local minimum at xz = 0. (You should, 
however, see that since sin@ ~ @ near 0 = 0, the graph y = sin(x?) looks 
about like that of y = x7, near x = 0, and since this clearly has a local 
minimum at x = 0, it is a good guess that sin(x”) has a local minimum at 
c=.) 

Here is a formal statement: 


Proposition 25.2.1 Let f be a function defined and differentiable on a 
neighborhood of p € R. Assume also that the second derivative f"(p) ex- 
ists. If 


f'(p)=0 and f"(p)>0 


then f has a local minimum at p. If 


f'(p)=0 and f"(p) <0 


then f has a local maximum at p. 


For the proof take a look back first at Propositions [23.1.2] and ??. 
Proof. We are given that f is differentiable on the neighborhood W of p. 
Thus the derivative f(x) exists and is finite at all  € W. Moreover, we are 
also given that f’ itself has a derivative (f")'(p) at the point p. 
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Suppose f”(p) > 0. Thus, the derivative of f’ at pis > 0. Then by 
Proposition the value f(a) for x immediately to the left of p is less 
than the value f’(p), whereas the value f’(x) for x immediately to the right 
of p is greater than the value f’(p). More precisely, there is a neighborhood 
U of p such that 


f(x) > f'(p) forx>pandaz €U; 
< f'(p) forx<pandz eu. 


Since we are given that f’(p) is 0 this means 


fi(2)>0) fore >pandz EU; 
<0 forr<pandzeU. 


Then by Proposition f has a local minimum at p. 

The argument for f’(p) < 0 is very similar, using Proposition ?? to see 
first that f’ is positive to the left of p and negative to the right of p and 
concluding then that p is a loca maximum for f. | QED 

There are functions for which both first and second derivatives are 0 
at the same point, and then we cannot draw any conclusions about local 
maximum/minimum at that point. For example, 


Yr 


is ‘very flat’ at x = 0 since both its first derivative, which is 4x°, and its 
second derivative 127° are 0 at x = 0. The point x = 0 is in fact a local 
minimum for z* but we cannot see this simply using the second derivative 
test. 

As another example of what can go wrong, consider 


g(x) = 2* — 47° +2 
The derivative is 
g(x) = 4a? — 12a? = 4x? (x — 3) 
and the second derivative is 


g' (a) = 12x" — 242 = 122(2 — 2). 
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Thus 
g'(0)=0, and g/(3)=0 


and 
g'(0) =0, and g"(3)=36>0. 


Thus x = 3 gives a local minimum, but we cannot draw any conclusions 
about x = 0 from the second derivative test. 


Chapter 26 


Exp and Log 


The exponential function is one of the most useful functions in mathematics, 
and is expressed through the amazing formula 


L 2 3 


1 1 a 
Dir ae ae er Se OT ig agg es (26.1) 


Its inverse function is the natural logarithm log. In this chapter we study 
the basic properties of these two functions. 

The history of the discovery /understanding of these two functions is an 
entertaining example of how mathematical concepts develop trough unex- 
pected twists and turns and near-misses [I] [2| 4]. Our approach is not histor- 
ical; we first summarize the essential facts about e” and log(x) that explain 
how to work with these functions, and then we give a logical development of 
the theory. This approach is fast but gives little insight on how or why these 
ideas were developed historically. 


26.1 Exp summarized 


The function exp is defined on R by 


Qe 3 
exp(z) Sea op ag 
for alla € R. It is a fact that this is a real number (finite) for all x € R. 
The number exp(1) is denoted e: 


191 
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1 1 
e=1+1+ Zt at © 2.718281... (26.2) 


The importance of e lies in the amazing identity 


x 


exp(z) = e”, 


which is another way of writing (26.1). What exactly it means to raise e to 
the power x, which is a real number, will be explored more carefully later. 
The value of exp at 0 is clearly equal to 1: 


exp(0) = 1. 


Moreover, the derivative of exp is again exp: 


exp’ = exp, (26.3) 
which can also be expressed as 
de* 
go ee 26.4 
ae (26.4) 


This property along with the value at 0 uniquely characterizes the function 
exp: any function on R whose derivative is itself and whose value at 0 is 1 is 
the exponential function. 

Figure shows the graph of y = e”. 

The graph of the exponential function shows rapid increase as x — oo 
and rapid decay to 0 as x — —oo: 


lim e” = co 

H tie d@,@) 

26.5 
lim e* =0. ( ) 

I—>—00 


Among the early studies of the exponential function is its use in describing 
the growth of money under compound interest. This approach leads to the 
following limit formulas: 


e* = lim (14 ~ | = lim (1+ =)" (26.6) 
nr nr 


n->co n—->oco 
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v 


Figure 26.1: Graph of the exponential function. 


26.2 Log summarized 


The natural logarithm is the inverse function of exp: thus log(A) is the real 
number with the property that 


exp (log(A)) = A, 

or, equivalently, 

eer (26.7) 
It is defined for all A > 0. 

For example, since e? = 1 we have 

log 1 =0. 
Since e! = 1 we have 

log1 = €. 


The notation In is also used to denote the logarithm. 
An alternative way to state the fact that log is inverse to the function 
exp is 
loge"=ax forallceER. (26.8) 


The basic algebraic properties of log are: 
log(AB) = log(A) + log(B) 
log(A/B) = log(A) — log(B) (26.9) 
log(A*) = klog( A), 
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e? = A is equivalent to B = log A 


B=log(A) * 


Figure 26.2: log(A) read off from the graph of y = e”. 


for all A,B > 0 and k € R. These properties made log enormously useful in 
carrying out complex calculations involving multiplication and division. In 
fact log (up to a scaling) was studied and used well before the exponential 
function was identified. 

The graph of log is the graph of Exp viewed from one side (thus, with x 
and y axes interchanged). We have the following limits of interest: 


lim log(z) = co 
w—-0o0 
=(). 


lim log(z) ee) 


~——0o 
The derivative of log is the reciprocal: 


dlog(x) 1 
= 26.11 
dx x ott) 


for 2 > U. 
Figure shows the graph of log: 


Yy y= log(x) 


Figure 26.3: The graph of log. 
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Here are a couple of useful values of log: 
logl=0, —_ log(e) = 1. (26.12) 
The function log is strictly increasing, and so 


loeig) <0: anaes: 


26.13 
log(a) 0° if. >, ( ) 


26.3. Real Powers 


For any real number y we have the positive integer powers 


Pal PSC Se i st Sr 


The 0-th power is 
yo =1. 
Negative powers are given by reciprocals 
eal 
y = ay? 
y 


for any positive integer n. Of course, this makes sense only for y 4 0. 
Next we have rational powers. For example, 


aay 


is defined to be the unique real number > 0 whose square is y: 


(y?)? = y. 


Thus y'/? is defined only for y > 0 (for otherwise we couldn’t square some- 
thing to end up with y.) 
More generally, for any positive real number A > 0 and integers p and q, 
with q 4 0, the power 
Ap/4 


is defined to be the unique positive real number whose q-th power is A?: 


(A2)P/4 = AP. 
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Thus, we have a definition for 

A’ 
for all positive real A and all rational r. These definitions are designed to 
ensure that the following convenient algebra holds: 


Arts = A’ A’ 
Ay aas (26.14) 
(AB)" = AB" 


for all positive real A, B and rationals r and s. 
Moving on to real powers, it is natural to define 


Av = lim A’. (26.15) 


r—>x,qEQ 


That this exists for all positive real A and x € R is intuitively clear but not 
simple to prove. Perhaps the shortest way to see that A® exists is by using 
log: for rational r we have 


Ars (elt 4)y" = ellos(A))r = e” los(A) = exp (r log(A)) i 


Since the function exp is continuous (it is, in fact, differentiable), we can take 
then limit r > x, for any real number 2, to obtain: 


A® = exp (x log(A)). (26.16) 


Taking A = e confirms the expected result 


oF 


e’ =exp(x) forall xeé 


We can also verify the algebraic relations: 
Arts At AY 
Ce ee sae (26.17) 
(AB)” = A*B", 
for all A, B,z,y € R with A,B > 0. 
Using the derivative of the exponential function we obtain immediately 
that 
dA* 
dx 
Note that on the right we have e”!°&4 which is, in fact, A. Thus, 
dA* 
dx 


= e784 . Jog(A). (26.18) 


= A® log(A). 
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26.4 Example Calculations 


To see how to work with derivatives of log and exp we work out a few exam- 
ples. 
We have already observed in (26.18) that 
dA* 
dx 
for any positive real constant A. 
Thus, for example, 
d22") , d2” 
= 2") log(2) - 
dx 08(2) dx 
= 2") Jog(2) - 2” log(2) 
=O los 2)". 


= A® log(A), (26.19) 


(by the chain rule) 


Next consider the derivative of the function x*. To work this out we 
rewrite x as el°s(*): 
gt = (eloe))" _ et log(x) 
Now we can differentiate this: 
dx” de log(x) 
dx dx 


— et log (x 


dz log(x) 
dx 


il 
aoe c -log(a#) +z - | 
x 


(by the chain rule) 


= ¢" [logx +1] (recognizing e?!°8) as x”). 
Note that 
1+ log(x) = loge + log(x) = log(ex). 
Similarly, 
dnl 1-1 
= Sg lle | SE | (26.20) 
dx a 


Observe that this is > 0 when x < e and is < 0 when x > e. This means that 
z'/* is increasing for x < e and decreasing for x < 0. So x!/* is maximum at 


e: 
max r!/* = el/€, 
a>0 
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26.5 Proofs for Exp and Log 


To really develop the theory and results for exp and log we should start with 
the definition of exp(x) as 


exp(z) =1+<24 I 31 eae 
However, we have not yet developed enough working tools for such power 
series, and so we will follow a modest strategy here. We assume that there 
exists a function exp defined on R that is differentiable and satisfies the 
following conditions: 


Sa! (26.21) 
exp(0) = 1. 


We will prove that there can be at most one such function and then prove 
the crucial relation 


exp(z) =e”, where e = exp(1). 
Observe by the chain rule that 


dexp(Kz) 
dx 
for any K € R. In particular, 


= exp(Kz)- K = K exp(Kz), 


dexp(—2x) 


ae = — exp(—z). 
Proposition 26.5.1 For any x € R the value exp(x) is not 0, and, more- 
over, 
exp(x) exp(—x) = 1. (26.22) 
Note that this says 
(-2)=— (26.23) 
xD(— = ; 5 
eee exp (a) 
Proof. First we take the derivative of exp(x) exp(—a) and show that it is 0: 
d — d d — 
apr) pt iG) 7 pte) nar era apt Lr) 


= exp(x) exp(—2) — exp(x) exp(—2) 
= 0. 
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Thus exp() exp(—2) is constant, and so 


exp(zx) exp(—z) = exp(0) exp(—0) = 1-1 =1, 


for all x € R. This proves and from this relation it is clear that 
exp(z) 4 0. | QED 
The stragegy used in the proof above, showing that the derivative of a 
function is 0 and then concluding that its value is constant, equal to the 
value at x = 0, will be used several times. 
The following result shows that the function exp is uniquely specified by 
the condition that exp’ = exp and the ‘initial’ value exp(0) = 1: 


Proposition 26.5.2 Suppose F is a differentiable function on R whose deriva- 
tive 1s equal to itself: 


Fl = F. 
Then F is a constant multiples of exp: 


F(z) = F(0)exp(z) forallx ER. 


Proof. Consider the function F(x) /exp(x), which is defined for all x € R 
since the denominator exp(x) is never 0 (Proposition |26.5.1). Taking the 
derivative we have 


ool) eo =F ea) by the quotient rule 
de (expla) ee 
_ exp(x) F(x) — F(a) exp(x) 
(exp(x))” 
= 0. 


Thus, F'(x)/exp(x) is constant, and so 


P(e) _ PO) _ FCO) 
exp(z) — exp(0) 1 


for all x € R. Hence 
F(x) = F(0) exp(z) 


for allz € R. QED 
Next we can prove a key algebraic property for exp: 
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Proposition 26.5.3 For every a,b € R we have 


exp(a + b) = exp(a) exp(b). (26.24) 


Proof. Consider any a € R and let F be the function 


F(a) = exp(a+2) for all x ER. 


Then 
F"(x) = exp’(a+2)-1=exp(a+2z) = F(z). 


Then by Proposition |26.5.2) we conclude that 


F(a) = F(0)exp(z) for all z ER. 


Observing that 
F'(0) = exp(a), 


we conclude that 


F(a) = exp(a) exp(z) for all z € R. 


Recalling that F(x) is exp(a + x) we are done. | QED 
It is useful to observe that this stage that exp(z) is strictly positive: 


Proposition 26.5.4 The function exp assumes only positive values: 


exp(z) > 0 for alla ER. (26:25) 


Proof. This follows from writing x as x/2 + 2/2 and using the previous 
Proposition: 


exp(x) = exp (5 + 5) = exp(#/2) exp(x/2) = [exp(w/2)). 


This, being a square, is > 0. Moreover, we know from Proposition |26.5.1 
that exp(x/2) is not 0. Hence exp(z) is actually > 0. |QED 

From the exponential multiplicative property in Proposition |26.5.3} we 
have 


exp(2a) = exp(a +a) = [exp(a)? 


and 


exp(3a) = exp(2a + a) = exp(2a) exp(a) = [exp(a)]? exp(a) = [exp(a)]*. 
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This line if reasoning implies that 


exp(Na) = [exp(a)]* (26.26) 


for alla € Rand N € {1,2,3,...}. Next, for such a and N we have 


exp(-Na) = (by B623)) 
1 
exp(a)) 


— 


Thus we have 
exp(na) = [exp(a)]” (26.27) 


for a € Rand all integers n € Z (you can check the case n = 0 directly: both 
sides are 1 in that case). 

To proceed to rational powers first consider a simple case exp (5a); this 
is a positive real number whose square is 


se) mans mate 


Hence exp(3a) is the positive square-root of exp(a): 


exp (5) = [exp(a)]'/”. 


Proceeding on to a general rational number 


a= where p,q € Z and q £0, 
qd 


we have 
[exp(ra)]* = exp(qra) = exp(pa) = [exp(a)]”, 
and so exp(ra) is a positive real number whose g-th power is [exp(a)|”. Then, 


by defininition of (-)?/%, we have 


exp(ra) = [exp(a)]”"". 
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Thus, 
exp(ra) = [exp(a)]" (26.28) 


for alla € Rand allr€Q. 
Finally, we can proceed to an arbitary real power. Consider any a,x € R. 
The definition of the z-th power is that 


fexp(a)|* = lim, [exp(a)]”. 


r>2z,rEQ 
But we know that for rational r we have 
[exp(a)]" = exp(ra), 
and so, since the differentiable function exp is continuous, we have 


lim _[exp(a)]" = exp(xa). 
r>2x,rEQ 


Putting everything together we have: 
Proposition 26.5.5 For any real numbers a and x we have 
exp(ar) = [exp(a)]” . (26.29) 


Now we specialize this to a = 1 to obtain the crucial formula: 


exp(z) = e” for all z ER, (26.30) 


where e is the value of exp at 1: 
e = exp(1). (26.31) 


The derivative of exp is exp, by assumption in our approach, and this can 


now be displayed as 
de* 


dz 


x 


Proposition 26.5.6 The exponential function is strictly increasing: 


a 


et <e? for all a,b € R with a < b. (26.32) 
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Figure 26.4: The inequality e? > 1+ 2 in terms of the tangent at (0,1). 


Proof. The derivative of exp is exp, and this is always positive. Hence exp is 
strictly increasing. | QED 


Taking one of the numbers a and b to be 0 and using e~0 = 1 we conclude 
that 


e>1 ifg> (0; 


26.33 
esl mye< 0, ( ) 
Proposition 26.5.7 The function exp satisfies 

exp(z) >14+ 2 for allx ER, (26.34) 


with = holding only when x = 0. 


A geometric way of understanding this inequality is that the graph of 
y = e* lies above the tangent line y = x + 1 at (0,1) (see Figure |26.4|) 
Proof. First observe that 


d(e” —(1+72)) 
dx 


This is positive when x > 0 and negative when x < 0. Thus, e” — (14+ 2) 
attains its minmum value when x = 0: 


e* —(1+2)>e®-(14+0)=1-1=0 for all x > 0 and all x < 0. 


This proves the inequality (26.34). | QED 
One consequence of the inequality (26.34) is the exp(x) goes to oo when 
x — OO: 


limi..2" = oo. (26.35) 


xw—- Co 
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Working with e~” = 1/e” we then see that 


lim -e* =0. (26.36) 


wL—>— Co 


The intermediate value theorem implies that the function exp has an 
inverse defined on all positive real numbers; this function is log: 


log = exp™': (0,00) > R, (26.37) 
satisfying 


exp (log(A)) =A for all A € (0, 00); 


26.38 
log (exp(B)) =B forall BER. ( ) 


If a function f, defined on an open interval U, has an inverse f~', and if 
f is differentiable at p € U with f’(p) 4 0 then the derivative of the inverse 
function is given by 


(26.39) 
where g = f(p), which means p = f~'(q). Applying this to the exponential 
function and its inverse, log, we have first 
re ee 
(6) rs 5 
ar ae 
for all p € R and q = e?. Restating in different notation this says: 


dlogx 1 
ee fe ; 26.4 
re . or x € (0, 00) (26.40) 


Chapter 27 


Convexity 


Convexity is a powerful notion. It is useful in establishing results about 
maxima and minima, and useful in many different applications. This chapter 
gives an initial glimpse of this deep and varied subject. 


27.1 Convex and concave functions 


A function f is said to be conver on an interval U if for any points a,b € U, 
the graph y = f(a), over the interval x € [a, b], below the secant line segment 
joining the point (a, f(a)) with (0, f(d)) . 


; y = f(a) 


° 
b 


ge- 


Figure 27.1: Convex function: the graph lies below secant segments 


By ‘below’ we allow the possibility that the graph and the secant segment 
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might touch or run along each other. For example, a function whose graph 
is a straight line is convex. 

A function f is strictly conver on an interval U if the graph of f between 
any two points in U lies strictly below the corresponding secant segment. 

A function f is concave over an interval U if the graph of f between any 
a,b € U lies above the secant segment joining A(A, f(a)) and B(b, f(b). It 
is strictly concave of the graph lies strictly above all secant segments. 

Thus, a function whose graph is a straight line is both convex and concave. 


Y 


A 


x 


Figure 27.2: Concave function: the graph lies above secant segments 


Note that we only consider the notions of convexity and concavity of 
functions defined on intervals. 


27.2 Convexity and slope 


Consider a function f on an interval U. Let us take three points a,p,b € U 
with 

a<p<b 
and examine the behavior of the secants over [a, p] and over [p, 0]. 

Let A be the point on the graph of y = f(x) with x-coordinate a; thus A 
is (a, f(a)). Next let B be (b, f(b)) and P the point (p, f(p)). Let Q be the 
point on the secant segment AB whose x-ccordinate is p. Thus Q is of the 
form 

(p, L(p)) 
where L(p) is the y-cordinate of Q. 

The condition that the point P(p, f(p)) lie below the segment AB means 
that P is below Q: 

f(p) < L(y). 
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This means that the slope of AP is < the slope of AQ. Now the slope of AQ 
is the same as the slope of AB. Thus the condition that P is below AB is 
equivalent to 


slope AP < slope AB. 


Similarly, the same condition is also equivalent to 


slope AB < slope PB. 


Thus, convexity of f is equivalent to the relations 
slope AP < slope AB < slope PB, 
for all a,p,b € U, with p € |a, d]. 


y 


Figure 27.3: Convex function: secant slopes increase 


Just the condition 
slope AP < slope PB (27.1) 


forces the slope of AB to lie in between these values. For example, since 
the slope of AP is < the slope of PB, the segment AP produced beyond P 
would end up below B, and this would mean that the slope of AP is < the 
slope of AB. 


Similarly, f is concave over an interval if the secant slopes decrease. 
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27.3. Checking convexity /concavity 


In this section we will see how to check convexity/concavity of specific func- 
tions and how to work with such functions. The methods used will be justified 
in later sections. 

A function f is convex on an interval U if its derivative f’ is an increasing 
function on U (this means f(s) < f(t) if s < t); it is strictly convex if f’ is 
strictly increasing (f(s) < f(t) if s < t). Mostly we can check for convexity 
even more efficiently: if the second derivative f” is > 0 on U then f is convex 
on U, and if f” > 0 on U then f is strictly convex on U. 

For concavity we have the analogous results. A function f is concave on 
an interval U if its derivative f’ is a decreasing function on U; it is strictly 
concave if f’ is strictly decreasing (f(s) < f(t) ifs < t). If f” is <O0onU 
then f is concave on U, and if f” <0 on U then f is strictly concave on U. 

Consider the function 


rae 


Its second derivative is 2: 


(y= ie: and) (el S20; 


and so x? is strictly convex on R. 
Next consider 


at 


Its second derivative is 1277, which is > 0 everywhere and so the function 
is convex. (In fact it is strictly convex even though 122? hits 0 at the single 
point x = 0.) 

More generally, consider 


a, 


for x in the interval (0, 00), where p is some constant. The derivative is px?~' 
and the second derivative is 


p(p — 1)x??. 


If the power p is > 1 then this second derivative is > 0 for all x € R and so 
x? is conver on (0,00) if p > 1. In fact, x? is strictly conver on (0,00) if 
SL 
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Another example is 


Its derivative is 


and the second derivative is 


which is > 0 when x > 0 and is < 0 when x < 0. Thus 
1/x is strictly convex for x € (0,00), 


and is strictly concave for x < 0. 
Now let us look at 


e”. 


The second derivative is e*, which is positive. Hence, e* 1s strictly convex on 
R. 
Lastly consider 
log x 


for « > 0. The derivative is 
1), 


which is strictly decreasing. Hence log x is strictly concave. 


27.4 Inequalities from convexity /concavity 


Recall that a function ®, on an inerval U, is convex if its graph lies below 
the secant segments, with strict convexity meaning that the graph always 
lies strictly below the secant segments. From this one can show that ® zs 
convex if and only if 


®(weighted average) < weighted average of ®, (27.2) 


so that, for instance 


210 Ambar N. Sengupta 11/6/2011 


for all a,b € U. Strict convexity means that here < would be replaced by 
< except in the trivial case where a = b. For concavity condition is 
altered by replacing < by >. 

The weighted average might involve several points/numbers drawn from 
U. For example, if ® is convex on an interval U then 


for all a,b,c € U. 
A weighted average is also called a convex combination. Thus, 


is a convex combination of a, b, c, and d. Thus, a conver combination of 
Pi,---,Pn € R ts of the form 


W1p1 + +++ + WNPN, 
where the weights w1,...,wyn lie in [0,1] and add up to 1: 
Wy ee wy = 1, 


Thus, (27.2), written out more formally says that a function ® on an interval 
U is convex if and only if 


O(wipi +--+: + waypn) < wi (pi) +--+: + wy ®(py) (27.3) 


for all p1,...,pn € U and all weights wi,...,wy € [0,1] (adding to 1), for 
every choice of N € {1,2,3,...}. The function ® is strictly convex if 
holds with < replaced by < if the points p;,...,py are all distinct and none 
of the weights is 1. For concavity we simply reverse the inequalities. 

Let us apply the characterization of convexity given in the inequality 
(27.3) to the functions we looked at in the preceding section. 

For the convex function x? we have, using the simplest interesting weighted 
average: 


1. i od 
“a+ —b) < a? + b?. 27.4 
(50+ 5) = aes b (27.4) 
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Thus, the the square of an average is at most the average of the squares. Note 
that since x? is actually strictly convex the inequality < can be replaced by 
<, unless a = 6. With three points we have 


fo en oh een aed 
“g¢+—b+-c) < =a? + ==’. 27.5 
(G04 5 +5°) age +3 3¢ (27.5) 
More generally, for any 7),...,2%~ € R we have 
2 
U++:++2yN i 
—— < (ob t--+ah). (27.6) 


Here the < can be replaced by < except in the case all the x;’s are equal. 
We can also write the inequality (27.6) as: 


(a) +-+-+2N)? 


tite +2y > a ' (27.7) 


with equality holding if and only if all the x; are equal. 

Here is a quick application: suppose a length L of wire is cut into N 
pieces, of lengths x71,...,2y, and each piece is bent into a square; what 
should the lengths x; be in order fot the squares to cover a minimum total 
area? To answer this notice that the sum of the areas of the squares is 


(21)? 4 (EY a tah 
4 4 16 


and from (27.7) we see that this is 


= (ay +---+ay)?/N _ L? 


= 16 16N’ 


with > being = if and only if all the x; are equal. Thus the minimum area 
is obtained if the wire is cut into equal lengths and the minimum area thus 
obtained is L?/(16N). 

Now we turn to another example, the strictly concave function log, defined 
on (0,00). Working with the equally weighted average of a,b > 0 we have 


1 1 1 1 
= = ; ; 
log a 3) = 5 loga + 5 logb (27.8) 
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The right side simplifies to 

5 (log + log b) = 5 loa(ab) = log Vab. 
So the inequality reads 


log (*) > log Vab, (27.9) 


with equality if and only if a = b. Since log is a strictly increang function, 
this inequality becomes 


b 
_ > Vab, (27.10) 


with equality if and only if a = b. Now let us apply concavity to three values 
a, b,c; this leads to 


b 1 
log (¢ = i "| 2 z (loga + log b + log c) = log(abc) 73, 
which means ; 
a > (abe)3, (27.11) 


with equality if and only if a, b and c are equal. 

The inequalities and area purely algebraic, having nothing 
to do with log, and, at first sight, seem to have no relationship to each 
other. (The first inequaity can be proved direct] just by simple algebra: 
(\/a— Vb)? > 0 implies a+b—2Vab > 0). However, both express a common 
idea: the arithmetic mean (AM) is greater or equal to the geometric mean. 
(GM) The general version of this AM-GM inequality is: 


fy es Poy 


N > (a1...2n)¥, (712) 


with equality if and only if all the x; are equal. This follows from the strict 
concavity of log. 

Not all convex functions of interest are differentiable everywhere. Here 
are a few convex functions that are not differentiable at all points: 


|x|, Uy = max{z, 0}, (a — K)4 = max{z — K, 0}, 
for any constant K € R. 
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oY 
y = |z| 


v 


Figure 27.4: The convex function |z| 


Y 


y= (e=1)4 


x 
Figure 27.5: The convex function (x — 1)4 


27.5 Convexity and derivatives 


We have seen in section [27.2] that convexity of a function can be understood 
in terms of increasing secant slopes. From this we arrive at a condition for 
convexity for differentiable functions: 


Proposition 27.5.1 Suppose f is a function on an interval U on which f 
is differentiable. 

Then f is convex on U if and only if f' is an increasing function in the 
sense that f’(s) < f’(t) for all s,t EU with s <t. 

The function f is concave if and only if f' is an decreasing function in 
the sense that f'(s) > f'(t) for all s,t EU with s <t. 


Proof. Suppose f is convex on U, and s,t € U with s < t. Then for any 
x,w € |s,t] with 
s<xa<w<t 
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we have the increasing slope condition 


f(x) — Fs) < f(t) = fw) 


L-Ss t—w 


Now taking the limits x > s and w — t we have 


f(s) < f'@. 


Conversely, suppose the function f on U has increasing slope f’. Consider 
a,p,b€ U with 
a<p<b. 
Let A be the point (a, f(a)), and B the point (6, f(b)) and P the point 
(p, f(p)). Then 


I (p) — fla) 
pa 


slope of AP = = f'(s) for some s € (a,p), 


by the mean value theorem. We also have: 


f(0) — fr) 


slope of PB = 
De 


= f(t) for some t € (p,), 


Since s < t (because p lies between them), we know that 


f(s) < f'. 


Hence 
slope of AP < slope of PB. 


Since this holds for all points a,p,b € U, with a < p < b, the function f is 
convex on U. 
The argument for concavity is exactly similar. | QED 


Observe in the proof that if f’ is assumed to be strictly increasing, that 
is 
f(s) < f(t) whenever s <t and s,t EU 
then f is strictly convex. 
Similarly, if f’ is strictly decreasing then f is strictly concave. 
When working with functions that are twice differentiable there is an 
easier condition for convexity: 
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Proposition 27.5.2 Suppose f is a function on an interval U on which f 
is differentiable, and suppose also that f’ is differentiable on U. 
Then 


(i) f ts convex if and only if f" is > 0 on U; 
(ii) f is concave if and only if f” is <0 on U. 
For strict convexity/concavity we have: 
(iii) f is strictly convex if f" is > 0 on U; 
(iv) f is strictly concave if f” is <0 on U. 


Note that in (iii) and (iv) we don’t have the ‘only if’ parts. For example, 
x* is strictly convex but its second derivative is 12x? which is 0 when x = 0. 
Proof. If f” >0onU then f’ is an increasing function on U (by Proposition 
and so by Proposition|27.5.1}we conclude that f is convex. Conversely, 
if f is convex then by Proposition the derivative f’ is an increasing 
function on U and so, by Proposition |23.1.1} its derivative f” is > 0 on U. 
The results (iii) and (iv) follow similarly. |QED 


27.6 Supporting Lines 


Consider a convex function ® on an open interval U C R. We have seen in 
(27.1) that for any point p € U the slopes of secant segments of the graph of 
f to the right of p exceed the slopes to the left of pj More formally: 


(a) — Bp) _ ¥(b) — Hp) 
a—p = b—p 
If we now squeeze in a value m € R between these left secant segment 


slopes and right secant segment slopes (we will examine this more carefully 
below) then we have 


for alla,be€ U witha<p<b. (27.13) 


slope of any secant seg- 
ment to the left of p 


slope of any secant seg- 


< < 
=" ment to the right of p. 


Consider now the line L through (p, ®(p)) whose slope is m. To the right 
of p it lies below all the secant segments for the graph of f and so the line L 
lies below the graph of ® to the right of p. But to the left of p the slope of L 
is greater than the secant slopes and so again the line L lies below the graph 
of ® to the left of p. Thus we have: 
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Proposition 27.6.1 Suppose ® is a convex function on an open interval U, 
and let p be any point in U. Then there is a value m € R such that the line 
through (p, ®(p)) with slope m lies below the graph of ®; more precisely, 


L(x) < ®(a) for alliz €U, 


where y = L(x) is the equation of the line through (p, ®(p)) having slope m. 


y = Oz) 
7 y = L(x) 


Figure 27.6: Supporting line for a convex function 


A line such such as L is called a supporting line for ® at. p. 

If ®'(p) exists at p then there is only one choice for the line L: it is the 
line through (p, ®(p)) with slope m = ©’(p), that is, it is the tangent line 
to the graph of ® at (p,®(p)). Notice that our conclusion that m is a real 
number forces ®’(p), if it exists, to be finite. 

Proof. All we have to do is prove that a real number m exists satisfying 
<m< 


O(a) — O(p) O(b) — O(p) 
a-p ~ ~  b-p 
Since (27.13) holds for every a € U to the left of p we see that the ‘right 
B( 


slope’ <V-"\) = P) 


for alla,b € U witha<p<b. (27.14) 


is an upper bound for all the ‘left slopes’, and so 


cp 2O= 2) - 2) = 20) 


a€U,a<p a—p b=p 


d 
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Thus, 
m_ < for all b€ U with p < b. 
Thus m_ is a lower bound for all the right slopes, and so 
Hi. = The (27.15) 
= wr, 20) = 80) 
m,= sup ————. 


bCU,b>p b—p 
Both the left and right derivatives m+ are finite as they exceed all the secant 
segment slopes to the left of p and are < all the secant segment slopes to the 
right of p. Now we can simply take m to lie between these values: 


m_leqm < m4 


QED 
The secant slope inequalities for a convex function have the following 
remarkable consequence: 


Proposition 27.6.2 Any convex function on an open interval U is contin- 
uous on U. 


Proposition |27.6.1| leads to a way of understanding convex functions by 
means of the supporting lines: 


Proposition 27.6.3 If ® is a convex function on an open interval U then 
® is the supremum of all the ‘line-functions’ that lie below tt: 


®(p) = sup L(p) for allp EU, (27.16) 
L<o 


where L denotes any function of the form L(x) = Mx+k, with constant 
M,k ER, for allx EU. 


Proof. We have already seen in Proposition that the graph of ® has 
a supporting line at every point. Thus for any p € U there is a function L, 
whose graph is a line, for which L(x) < ®() for all x € U (this means the 
graph of L lies below the graph of ®) and L(p) = ®(p). This proves (27.16), 
by showing that in fact there is actually an L for which both L < 6 on U 
and L(p) = ®(p). |QED 
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27.7 Convex combinations 


If a,b € R are points with a < b, then any point p € [a,b] can be reached by 
starting at a and moving towars b, covering a fraction 


of the interval [a,b]. Thus, 


a formula you can readily check directly with algebra. Let jz denote the 
fraction 

eae ae. 

* ft 

Since p € [a, }] it is clear that 4. > 0 and the most it can be is (b—a)/(b—a) = 
1: 


m 


p € (0, 1]. 
We can write p as 
p=at+y(b—a)=a+p(b)-pa=a-—pa+ po=(1—-p)at po. 


Writing \ for 1 — ys we thus see that every point p € [a,b] can be expressed 
as a convex combination of a and b: 


p=Aa-+t pb, (27.17) 


where A and p are weights, in the sense that they are non-negative and add 
up to 1 (which forces \ and pz to be < 1): 


A,  € [0, 1], A+ p=. 


Conversely, any convex combination (27.17) is at most b (the weight on a, 
which is < b, would draw the value of p down below b) and at least a: 


a < any convex combination of a and b < b. 


The point half way between a and b is 


1 1 a+b 
= 
got 5 2” 
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whereas the point 2/5-th of the way from a to 6 is: 
3 2, 23a + 2b 


a = 
5 5 5 
A convex combination of points p,,...,py is a point which can be ex- 
pressed as 


W1p, +++: + WNDN, 


where the Jem weights wy1,..., wy all lie in [0,1] and sum to 1: 
wyt---+twy = 1. 


Note that if pj = --- = pn, all points coinciding, then their convex combina- 
tion is just that one point, since then 


wigt---+wng=(uit---+un)¢=1-q=4, 


where q is the common value of the 7;. 

For a given set of points pi,...,pn, the largest (the most to the right) 
that a convex combination could be is max, p;, and for this we would have 
to distribute the entire weight 1 on those j for which p; is largest, and take 
all the other weights to be 0; for example, for a,b,c, withd = c> b> a, the 
weighed average 


wa + web + w3c + wad = wya + web + (w3 + wa)d 


is largest if w3 + wa = 1 and w, = wo = 0. 

Similarly, the least value a convex combintation of p,,...,px could have 
is min, p;, and this is obtained if and only if a weight of 0 is given to non- 
minimum values of p,. 

A multiple convex combination can be built out of convex combinations 
of pairs. For example, 


2 3 5 2 8 ee) 


+ b+ —c= —a4 
i 1 10 10° 10) = 

where notice that the quantity inside [---] is indeed also a convex combina- 
tion; we thus have, on the right, a convex combination of convex combina- 
tions. More generally, for N € {2,3,...}, any points p1,...,pn € R, and any 


weights wi,...,wwn € [0,1] (summing to 1) we have 


Wipi +++: + WNDPN 
1l—w, 


wip, +++: + WNPN = W1pi + (1 — w1) | (27.18) 
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and the quantity inside [---] is a convex combination of the N — 1 points 
Pi,-++,;PN-1- 

The following result expresses the simultaneous convexity and concavity 
of functions whose graphs are straight lines: 


Proposition 27.7.1 Jf L is a function whose graph is a straight line, that 
is of 


L(iz)=Mzr+k  forallc eR, 


for some constants M,k € R, then 


L(wipi +++: + wwpw) = wiL(pi) +--+ + wy L (py) (27.19) 


for allp,,...,pn € R and wy,...,wy € R with wy +---+uy = 1. 


Note that in (27.19)) we have just a linear combination 
W1p1 + +++ + WNPN 


and the coefficients w; need not be in [0,1] nor have to sum to 1. 
Proof. It is more convenient to start with the right side of (27.19): 


wi L(pi) +--+ + wn L(py) = wi(Mpi + ki)--- + ww(Mpn + k) 
=uw1Mp, + wik+---+wyMpy + wyk 
= Muipi +---+ Munpn + (wi t+---+wn)k 
= M(wipi +---+wnpn) +k 
(because w) + --- + wy = 1) 


= L(wipi +--: + wwpy). 


Thus LZ maps linear combinations to linear combinations. | QED 
This leads to the following convenient formulation of convexity of a func- 
tion ®: 


Proposition 27.7.2 A function ® on an interval U C R is convex if and 
only if 

®(Aa + pb) < A®(a) + wB(b) (27.20) 
for all a,b € R and all weights A, u € [0,1] with A+" =1. The function 
® is strictly convex if and only if holds for all a,b, A, as above but 
with < replaced by < whenever the three points a, Aa+ pb and b are distinct 
(no two are equal to each other). 
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Proof. We can work with a,b € U, with a < 6 (if a = b then (27.20) ) is 
an equality, both sides being ®(a)). Let A be the point (a, ®(a)) and B the 
point (b, ®(b)). The straight line joining A to be B has equation 

go Lie) = Mek 
for some constants. Consider now any point p € |a,b]; we can write this as 

p=Aat pb 

for some A, € [0,1] with A+ yu = 1 (see (27.17)). The condition that the 
graph of ® is below the graph of L is 


B(p) < L(p) 

for all such p. Now 

L(p) = L(Aa + ub) = AL(a) + pL (0), 
by Proposition 27.7.1] Since y = L(x) passes through A and B, on the graph 
y = ®(x), we have 

L(a) = ®(a), and L(b) = ®(b). (27.21) 
Combining all these observations we have 

®(Aa + pb) < AL(a) + wL(b) = A®(a) + pII(b), 


which establishes as being equivalent to the convexity condition for 
®. For strict convexity, the point (p, ®(p)) lies strictly below (p, L(p)), which 
means ®(p) < L(p) when p is strictly between a and b. Translating from p 
to Aa+ wb, and using again the equalities we obtain the condition for 
strict convexity of ®. | QED 

It is now easy to raise the inequality to an inequality for convex 
combinations for multiples points. For example, for points p;,p2,p3 € U, we 
have 


W +w 
P(wipi + Wop2 + Wsps) = ® (wn aL a) (“eter 
= Wi 


Jp tea air Gar) 


1l-—wy, 


= w1L(pi) + (1 — wv) (, Lips) + 


= w1L(p1) + w2L(p2) + w3L (ps). 


i “Lp 
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This procedure (the method of induction) leads to the conclusion that 
O(wipi +--- + wnpn) < wi P(pi) +--+ wy ®(py) (27.22) 


for every convex function ® on any interval U, any N € {1,2,...}, any points 
pi,-.-,pn €U, and all weights w1,..., wy» € [0,1] adding up to 1. 
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Exercises on Maxima/Minima, Mean Value Theorem, 
Convexity 


1. Find the maximum value of x?/* for x € (0,00). Explain your reasoning 
fully and present all calculations clearly. 


2. Find the distance of the point (1,2) from the line whose equation is 


32+ 4y—5=0. 


3. Suppose f is a twice differentiable function on [1,5], with f(1) = f(3) = 
f(5). Show that there is a point p € (1,5) where f”(p) is 0. 


4. Explain briefly why 


log 101 — log 100 < .01. 


5. Prove the inequality 


for any a,b > 0. 
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Chapter 28 


L’Hospital’s Rule 


L’Hospital’s rule makes it possible to compute weird limits such as 


lim 2, 
xwL—->0O 
and is worth studying just for that reason. It has been a staple topic in any 
introduction to calculus since |’Hospital’s own book, reputed to be the first 
textbook on calculus. 
Briefly, ’ Hospital’s rule says 
= i 


lim ~~ = lim 
ep g(x) ep g! (x) 


if f(x) and g(x) both go to 0, or both go to too, as x > p, and if the limit 
on the right exists. Assuming that f and g both have domain a set S, here 
is a more detailed statement of the conditions: 


e there is a neighborhood U of p € R* such that that the part of U in 
S, with p added in if necessary, that is the set W = (SMU) U {p}, is 
either U or a one-sided neighborhood of p of the form (a, p] or [p, a), 
for some a € R; 


e g(x) £0 and g'(x) £0 for all e € W, with x Fp; 


e lim,_,, f(x) and lim,-,, g(x) are either both 0 or are both in {—oo, oo}; 


f'(z) 


Wa) exists. 


e the limit lim,_,, 


225 
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28.1 Examples 


Avoiding silly examples such as 


(which is 2 as can be seen directly from x2? — 1 = (x — 1)(x+1)) let us work 
out the following simple but more interesting use of l’Hospital’s rule: 


The first thing to observe is that both numerator and denominator go to 0 
as x — 0. So we could, poentially use l’Hospital’s rule: 


se (28.2) 


provided the limit on the right exists. To deal with this limit observe again 
that both numerator and denominator go to 0 as x — 0, and so we could 
again try l’Hospital: 

cosr¢—1l |, —sing sin © 


= aif 28. 
Bay 132? 240 43-29 ro og” ce 


and we do know that this limit exists and its value is —1. Thus l’Hospital’s 
rule does imply that the equality holds, and this shows that the right 
side of exists, which then justifies the equality by l’Hospital’s 
rule. This somewhat convoluted logic is summarized simply in: 


sing — 2 yH..... cosx— 1 : . ; . 
—,— = lim —,.—__ (if the right side exists) 
3 19,2 

at 230 33x 
3! 3! 
VE. = Sine 
= lim 5 

2-0 33 -22 


lim 


x20 


(if this exists) 


sin x& 


= — lim 
z>0 2 


=-1 (whch justifies the 2nd and hence the Ist equality above). 
(28.4) 
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Thus, for x close to 0 we should be able to approximate the difference 
sina — x by — 42°, and so 


sing Y& 7 —- —2? 
3! 


for x near 0. 
Now we carry this a step beyond, finding an estimate for the difference 


I 35 
sin wv — ame : 


By repeated use of l’Hospital’s rule we have: 


; 18 19,2 
Siig (2 ae |p cos —| D387 
lim | 3 = lim | 3 (if the right side exists) 
x0 ae «2-0 me 
cose = [T= er| ee ee 
= lint 7 (by algebraic simplification) 
x0 74 
a 
/ —sins x 
iz i [= 222] (if the right side exists) 
«2-0 qe? 
sinz — 2x 
= — lim (by algebraic simplification) 
«z—0 Oa 


=-—(-1) (by the previous example) 
=1. 
(28.5) 


Each application of the Hospital rule above was a case where the limit of 
the ratio had the form 0/0. 
We turn now to a different example: 


. 1 : E 
lim x: = lim [e'e*| 2 
wLCO bee A@,@) 


— glimz+oo “FS (the exponent here is formally 00/00) (28.6) 
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28.2 Proving l’Hospital’s rule 
The key step in proving |’ Hospital 
fe) _ 5 Fa) 


mop gla) 24” gf) 


d 


with f(x) and g(x) both — 0 as x = p, is the observation that 


f(a) _ flo) 


g(x) g'(e)’ 


for some c between x and p; when x — p, the point c also > p and this 
shows that the above ratios approach the same limit. This is formalized in 
the following version of the mean value theorem: 


Proposition 28.2.1 Suppose F and G are continuous functions on a closed 
interval |a,b], where a,b € R* anda < b, with values in R. Suppose that F 
and G are differentiable on (a,b), with G'(x) 4 0 for all x € (a,b). Then 


Ms (28.7) 


for some c € (a,b). 


Since G’ is never 0 on (a, b) it follows by Rolle’s theorem that G(b)—G(a) 4 0. 
Proof. Consider the function H defined on |a, b] by 


A(x) = [G(6) — G@][F(2) — F@] — [F() — F@NIG@) — G@)] 


28.8 
for all x € |a, }]. ee 


This is clearly continuous on [a,b] and differentiable on (a,b) with derivative 
given by 
A ('x) = [G(b) — G(a)| F'(x) — [F(0) — F(a)] G'(2) (28.9) 


for all x € (a, 5). 
Observe also that 
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Then by Rolle’s theorem applied to H there is a point c € (a,b) where H’(c) 
is 0; this means 


[G(b) — G(a)] F'(c) — [F(b) — F(a)] G’(c) = 0. (28.10) 


Thus 
[F(b) — F(a)] G'(c) = [G(b) — G(a)] Fo), 


which implies the result (28.7). QED 
Now we can prove one form of |’Hospital’s rule: 


Proposition 28.2.2 Suppose f and g are differentiable functions on an in- 
terval U C R*, with g(x) 4 0 and g'(x) #0 for all x € U in some neighbor- 
hood of a limit point p € R* of U, and suppose 


lini. 7 (2) =U and lim ote) = 0, 


Then 
PO) a 


lim — =] 
mip g(a) t= (=) 


if the limit on the right in 428.11) exists. 


(28.11) 


Proof. Let W be a closed interval, one of whose endpoints is p and for which 
all points of W, except possibly for p, lie inside U. Define F and G on UU{p} 
by requiring that F(x) = f(x) and G(x) = g(x) for all x € U with x F p, 
and setting 


ip) =0 and Gip) =0. 


What this does is make f and g defined and continuous at p in case it wasn’t 
to start with; more precisely, F' and G are continuous on U U {p}. For any 
x € U, in a neighborhood of p, with x 4 p we have 


f(a) _ F(x) — F() 
g(t) G(x) — G(p)’ 
because F'(p) and G(p) are both 0 and F(x) = f(x) and G(x) = g(x). Then, 
for such x, we have by Proposition 
fla) _ P(cx) _ f'(ce) 
g(t) G(x) g'(ex)’ 
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for some c, strictly between x and p. Letting 7 —+ p makes c, — p and so 


fan £2) — tim £2) — ape, LW) 


zp g(t) 2p g' (Cx) wp g/(w) 


o] 


since we assume that the right side here exists. | QED 
We turn now to the case where f and g have infinite limits: 


Proposition 28.2.3 Suppose f and g are differentiable functions on an in- 
terval U C R*, with g(x) £0 and g'(x) £0 for all x € U in some neighbor- 
hood of a limit point p € R* of U, and suppose 


lim f(x) € {—oo, co} and im 9(z) € {—co, oo}. 


«Lp 


a f(r) f(a) 
i c_ im ne : 
gle) 0g) i 


if the limit on the right in 428.12) exists. 


Proof. From 


_ 1 — £4 
f(e) = f(@) _ F(@) | i aa) 
1 — 9a) 
g(x) 
we see that since both f(a) and g(x) approach too as x — p, the limiting 
behavior of f(x)/g(ax) and fh is the same. This is the motivation for 
the strategy we use. 


Let 


epi te) 
Oe a) 


d 


and W any neighborhood of L. Choose a smaller neighborhood W, of L such 
that 
cW, CW for all c € (1/2, 2). (28.14) 


There is an interval V that is a neighborhood of p such that 


f'(z) 
g(x) 


ew, (28.15) 
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for alla € VNU and « ¥ p. Pick any two distinct points a,x € VOU, 
neither equal to p; then 


fl@)- fla) _ fo) 


g(x) — g(a) gc) 


for some c between a and 2, and hence in V (note that g(x) 4 g(a) because 
by Rolle’s theorem if g(x) = g(a) then g’ would be 0 at a point between a 
and x, contrary to the given assumption that g’ is never 0 on U). Hence by 


we have 
f(z) — f(a) 


g(x) — g(a) 
for distinct a,x € VNU, neither equal to p. Since 


_ g(a) 
lim 1S ae) | 
xL—->p 1 —_ f(a) 


there is a neighborhood Up of p such that 


€ Wi, 


1 — 964) 
; MH) €(1/2,2) for alla € UU 


~~ F@) 
From (28.13) we have 


f(a) _ [Aey—se) Pe 


_ fla) 
g(x) 1- $3 


for alla € Up QNUNV. Since this is true for any neighborhood W of L, we 
conclude that f(x)/g(x) > L as x > p. |QED 
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Exercises on |’Hosptal’s rule 


1. Work out the limit 
cos x — [1 — 427] 


lim 
x0 ae? 


clearly justifying each step. 


2. Suppose g’ is continuous, g(2) = 0 and g/(2) = 1. Work out the limit 


lm g(2 + w) + g(2 4+ 3w) 
w—-0 WwW 


clearly justifying each step. 
3. Find 


Chapter 29 


Integration 


The development of calculus has two original themes: (i) the notion of tan- 
gent to a curve, (ii) computing areas of curved regions. The former leads 
to differential calculus, and the latter to integral calculus, to which we turn 
now. 


29.1 From areas to integrals 


The classical idea of area A of a region S' enclosed by a curve is that A should 
be < the sum of areas of any finite collection of squares that cover the region S 
and A should be > the sum of areas of any finite collection of squares inside 
S. This simple and perfectly natural idea fails to produce a completely 
satisfactory and usable measure of area when the region S is ‘unintuitive’ 
(for example S consists of all points (x, y) € R? with irrational coordinates), 
but it is meaningful, intuitive and computable for regions bounded by well 
behaved curves. 
We are mainly interested in the area of the region lying below a graph 


y = f(z), 


and above the x-axis, for x € [a,b]. See Figure For this discussion we 
assume f > 0. Suppose A is the area of this region, a notion we will pin 
down as the discussion progresses. 

An overestimate of A will surely be obtained by the sum of areas of ‘upper 
rectangles’ obtained by slicing the region into vertical pieces. More precisely 
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partition [a,b] into N intervals marked off by 
Sty es ace tie SO. 
By the k-th ‘upper rectangle’ we mean the rectangle whose base runs along 


the x-axis from «© = tp_; to x = t, and whose height is 
M,= sup f(z). (29.1) 


x€[tp—1,tk] 


The area of this upper rectangle is 


My(ty — te-1). 
The width t, — t,_1 is often denoted by At;,: 
At, = te — te-1. (29.2) 
Thus the area of the k-th upper rectangle is 
M,Atr. 
The sum of all the upper rectangles is 
N 
S > My Ate 2 MN ee Me. 
k=1 


Thus the upper rectangles provide an overestimate of the area A: 


N 
A< S- M,Aty. 
k=1 
Similarly, working with lower rectangles we have an understimate of the 
area: 


N 
k=1 
where 
mys. it. fle) (29.3) 


re[tp—1,tx] 
Thus the actual area A lies between the overestimates given by the upper 
sums and underestimates given by the lower sums: 


N N 

k=1 k=1 
Surely, the area A is the unique value that lies between all the upper sums 
and all the lower sums. We can take this as definition of area under a graph. 
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] 
y = f(z) 
> U 
a b 
Figure 29.1: Area below y = f(z) 
y 
Mz= sup f(z) y = f(z) 
x€[t1 ,t2] 
M, Mote M; 
a = to ty tg i —— ts 


Area under graph is < M,At, + MjA, + M3Ats 
SS 


sum of upper rectangle areas 


Figure 29.2: Area below y = f(x) overestimated by upper recangles 


29.2 The Riemann integral 


The ideas of the preceding section lead to the crucial notion of the integral 
of a function. Let us first extract some helpful terminology from our earlier 
discussions. 

A partition P of an interval [a,b], with a,b € R and a < 8, is a finite 
subset of [a,b] containing both the end points a and b. Typically we denote 
a partition by 


Pe TiGc ihe ses 


where 


A=%<U<...<an=b. 


236 Ambar N. Sengupta 11/6/2011 


The width of the k-th interval is denoted 


Ax, = Lk — LR-1- (29.5) 
For a function 
f : [a,b] > R, 
and the partition P, the upper sum is 
N 
U(f,P) = >_ My (f)Acg, (29.6) 
k=1 
and the lower sum is 
N 
L(f,P) = Yo me(f)Ace, (29.7) 
k=1 


where 


M.(f)= sup — f(x) 


ee [te—1,th] 


m(f)= inf — f(x). 


rE [teh_-1,th] 


(29.8) 


In the degenerate case where b = a, the only partition of [a, a] is just the 
one-point set {a}, and the upper and lower sums are taken to be 0. 
If there is a unique value A for which 


UPS ASUGL) (29.9) 


for every partition P of |a,b], then A is called the Riemann integral of f, and 


denoted ; 
[tr 


We will refer to this simply as the integral of f from a to b or over [a,b]. 
We say that f is integrable if f f exists and is finite (in R). 
The definition of the integral here is in the same spirit that of the concept 

of limit back in (6.1) and the concept of tangent line in (13.1). 

From we see that an approximation to i f( is given by 


/ fla) da = » f(ap)Aae, (29.10) 
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where x; is any point in [,_1, 2%], for each k € {1,..., N}. The sum on the 
right in is called a Riemann sum for f with respect to the partition 
P. The relation (29.10) suggests the historical origin of the notation | for 
integration. 

Note that if sup,cjay) f(2) is co then at least one M;(f) is oo, for any 
partition P, and so the upper sums are all oo, and in this case the integral 
1 f, if it exists, must also be oo. Similarly, if infrejay) f(a) is —oo then J. f, 
if it exists, is —oo. 

Thus, if f ws integrable then it is bounded, in the sense that both its 
supremum and its infinmum are finite. 

If f is constant, with value C' € R then, working with any partition P as 
above, we have 


M,(f) =C and =o mg (f) =C, 
for all k € {1,..., N}, and so 


and 
L(f, P) = CAg, +---+CAry = C(Az, +---+ Ary) =C(b—-a). 


Hence 


[c=co-0) (29.11) 


29.3 Refining partitions 


Consider a function f on an interval [a,b], and a partition P = {x,..., xn} 
of |a, b], with 
O= 06 Sas tw a 


Let us see what effect there is on the upper and lower sums when points are 
added to P to make it a finer partition of [a,b]. Let us start by adding one 
point s € (x;_1,7;) to the j-th interval. 


In the sum 
N 


U(f,P) = 5° Mx(f) Ace 


k=1 
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all terms remain the sum except for the j-th term: for this 
M;(fjAe; 


is replaced by 
A(s — #31) + B(a; — s) 


where A is the sup of f over [x;_1,s] and B is the sup of f over [s, 75]: 


A= sup f(z) and B= sup f(z). 


w€[x,-1,5] x€[s,x,] 
Clearly these are both < M;(f): 
A,B < M;(f), 
and in fact at least one of them is equal to M;(f). Hence 
A(s — #j-1) + B(aj — 8) < Mj(f)(s — @)-1) + Mj(f)(a; — 8), 
and observe that the right side here adds up to M,(f)Ax;; thus: 
A(s — 3-1) + B(aj — 8) < M;(f)Aa;. 
Looking back at the upper sum U(f, P) we conclude that 
UT Pies U2), 
where P, is the partition obtained by adding the point s to P: 
Pea Ua Sy 


Thus, adding a point to a partition (that is, splitting one of the intervals into 
two) lowers the upper sum. 
A similar argument shows that 


Lf, Pi) 2 LG, P); 


adding a point to a partition raises the lower sum. 

Adding points one by one enlarges a given partition P to any given larger 
partition P’, and at each stage in this process the upper sum is lowered and 
the lower sum is raised: 
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Proposition 29.3.1 Let f : [a,b] + R be a function, where a,b € R and 
a <b, and P and P' any partitions of |a,b| with P Cc P’; then 


L(F,P) SL, P) 
UDP ySUU): 


This implies the following natural but strong observation: 


(29.12) 


Proposition 29.3.2 Let f : [a,b] + R be a function, where a,b € R and 
a <b, and P and Q any partitions of |a, b|; then 


Lf, P) < U(F,Q). (29.13) 
Thus, every upper sum of f is > every lower sum of f. 


We have seen something similar in our study of limits back in (6.14). 
Proof. Let 


PSPudg. 
Then P’ contains both P and Q, and so by Proposition |29.3.1| we have 
L,P)<LG,P) and U(f,P’) <UCF,@). 
Combining this with the fact that L(f, P’) < U(f, P’) produces the inequality 


(29.13). [QED 


29.4 Estimating approximation error 


Consider a function f on an interval [a, 6] C R, and let P = {2,...,2n} be 
a partition of [a,b] with 


C=1) <a. — ey = oh 


We know that the integral of f, if it exists, lies between the upper sum 
U(f, P) and the lower sum L(f, P). So if U(f,P) and L(f, P) are close to 
each other then either of these sums would be a good approximation to the 
value of the integral. Let us find how far from each other the upper and 
lower sums are: 


— aa (29.14) 
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where M;(f) is the sup of f over |[x,~1, 7%] and m,(f) is the inf of f over 
Pere Pel. Thus 


M,(f) — me(f) = the fluctuation of f over [x,_-1, xx]. (29.15) 


Thus if we can partition |a,b] so finely that the fluctuation of f over each 
interval [x,_1, 24] is < .01 then 


U(f,P) —L(f, P) < 01Ag,+---+ 01Ary 


= 01 [Ax tere t Azn] = (0.01) (b _ a). (29.16) 


Thus we can shrink U(f, P) — L(f, P) down by choosing the partition P such 
that the fluctuation of f over each interval [x,~1, 2%] is very small. 
29.5 Continuous functions are integrable 


The discussions of the preceding section lead to the following important re- 
sult: 


Theorem 29.5.1 Jf f : [a,b] — R is continuous, where a,b € R with a < b, 
then f is integrable. 


The key to proving this result is the ability to partition [a,b] in such a 
way that the fluctuation of f over the intervals are all very small: 


Proposition 29.5.1 If f : [a,b] > R is continuous, where a,b € R with 
a <b, then for any € > 0 there is a partition P = {xo,...,xn} of [a,b], with 
a=2%<...< £y =), such that the fluctuation of f over each interval 
[te 15 Bel 19 < 


This property is called uniform continuity of f. 
Proof. We work with a < b, as the case a = b is trivial. Take 


Lo = a. 


Since f is continuous at a there is some x; > a = 2 such that 


f (wo) — 5 < F(@) < fla) +5 
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for all x € [9,21]. We can clearly take x, < 6, as here is no need to go 
beyond b. Thus 


sup f(e) < f(x) + 5 


rE [x9 ,r1] 
and g 
inf f(x) > f(a) -— -. 
xE[xo0,r1] 4 


These conditions imply that the fluctuation of f over |x, 71] is < €/2, which 
is, of course, < €. 

Now we can start at 71, if it isn’t already b, and produce a point x > x1 
for which 


f(a) — ; < f(x) < f(a) + : 


for all x € [21,22]. Again, we can take x2 < b. It might seem that in this 
way we could produce the desired partition P. But there could be a problem: 
the process might continue infinitely without reaching b. Fortunately, we can 
show that this will not happen. 

Suppose s is the supremum of all ¢ € [a, 6] such that [a,t] has a partition 
Po = {xo,.-.,@«} for which the fluctuations of f over every interval of the 
partition is < «. Note that s > a. By continuity of f at s there is an interval 
(p,q), centered at s, such that the fluctuation of f over (p,q) /N [a,b] is < €. 
Pick any point t € (p,s), with t > a; then since t < s the definition of s 
implies that there is a partition 


Py = 41Sje et 


of [a, t] such that the fluctuation of f over each interval [7;_1, xj] is < €. Now 
pick any point r € [s,q) M |a, b] and set 


oe a 


Since the fluctuation of f over (p,q) is < ¢, the fluctuation of f over the 
subinterval |t,r] is < ¢. Thus we have produced a point r, which is > s, 
such that there is a partition P = {xo,...,vxK41} of [a,r] for which the 
fluctuations of f are all < €. To avoid a contradiction with the definition of 
s, we must have s = b (for otherwise, if s < b, we could have chosen r to be 
> s) and the partition P has the desired fluctuation property. | QED 


Now we can prove Theorem |29.5.1 


242 Ambar N. Sengupta 11/6/2011 


Proof of Theorem 29.5.1) All we need to do is show that for any € > 0 there 
is partition P of [a,b] for which U(f, P) — L(f, P) is < e; this will imply that 
there is a unique value that lies between all the upper sums and all the lower 
sums. Let € > 0 and set, for convenience 


en: 
~ f= a 


By Proposition |29.5.1}there is a partition 
Pax hs 


Ey 


of [a,b] such that 
A=%<...<4¢“n=b 


and 
My(f) — me(f) < 


for all k € {1,...,N}. Then, by the argument used before in (29.16), we 
have 


U(f, P) =) PF) ae ei(b— a). 
Our choice of €,; then means that U(f, P) — L(f, P) is < «. |QED 


29.6 A function for which the integral does 
not exist 


Consider the indicator function 1g of the rationals: 
1 ifxeEQ; 
1o(2) = uae: (29.17) 
0 ifx€Q 


We focus on the restriction of 1g over any interval [a,b] C R, with a < 6. 
Let P = {xo,...,¢y} be any partition of [a,b], with 


A=%<U<...<¢yn=b. 


Observe that lg attains both the value 1 and the value 0 on each interval 
[~~-1, 2%] (because every such interval contains both rational and irrational 
points). Hence 

M,(1g) = sup l1g(z)=1 and m(1g)= inf  1lg(x) =0. 


LE[ep—-1,¢p] r€[rp-1,0k] 
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This makes the upper sum large: 
U(1g,P) =1-Ag,+...+1-Aty =b-a, 
and the lower sum small: 
L(1g, P) =0-Aai+...+0-Azry =0. 


There certainly are many real numbers lying between 0 and b — a, and so 
there is no unique such choice. Hence, 


b 
i lg does not exist. 


29.7 Basic properties of the integral 
Integration of a larger function produces a larger number: 


Proposition 29.7.1 If f and g are functions on an interval [a,b], where 
a,b € R anda < b, for which the integrals ce and fog exist, and if f <q 


then 
b b 
Jtsfo 


Proof. Suppose ie > i: g. Since {9 is the unique value lying between 
L(g, P) and U(g, P) for all partitions P of [a,b], there must be a partition P 
of [a,b] such that 


ee U(g, P). 


Again, since a f is the unique value lying between all upper and lower sums 
for f there is a partition Q of [a,b] for which 


L(f,Q) > U(g, P). (29.18) 


Let 
P= Pg: 


which is a partition of [a,b]. Since Q C P’ and f < g we have 


L(f,Q) < L(g,Q) < L(g, P’). 
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Since P C P’ we also have 
U(g, P') < U(g, P). 
Combining these inequalities produces 


L(f,Q) < L(g, P), 


contradicting (29.18). | QED 
Next we have linearity of the integral: 


Proposition 29.7.2 Suppose f and g are functions on an interval {a, b], 
where a,b € R anda < b, for which the integrals f f and {9 exist. Assume 


also that the sum 
[s+ fos 


is defined (thus, not co+(—oo) or (—00) +00). Then the integral i f exists 


™ [reo - [re [os (29.19) 


Moreover, for any k € R the integral ih kf of the function kf also exists and 


frrne fs (29.20) 


If a function is integrable over an interval then it is integrable over any 
subinterval: 


Proposition 29.7.3 Suppose f is integrable over |a,b], where a,b € R and 
a<b. The f is integrable over |c, d| for any c,d € [a,b] with c < d. Moreover, 


fr-farft (29.21) 


Chapter 30 


The Fundamental Theorem of 
Calculus 


The fundamental theorem of calculus connects differential calculus and in- 
tegral calculus, and makes it possible to compute integrals by running the 
derivative process in reverese. 


30.1 Fundamental theorem of calculus 


Here is one form of the fundamental theorem: 


Theorem 30.1.1 Suppose f is an integrable function on [a,b], where a,b € 
R, witha <b. Then the function F' defined on |a,b] by 


r= ff for all x € [a,b] 


is differentiable at p € |a,b] if f is continuous at p, and F"(p) is f(p). Thus, 
if f is continuous on [a,b] then 


= f(z) (30.1) 
for all x € [a, 6]. 


Notice that (30.1) implies that any continuous function f on an interval 
is the derivative of some function F on that interval. 
Here is another version of the result: 
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Theorem 30.1.2 Suppose g is a differentiable function on |a, b], where a,b € 
R, with a <b, for which g! is continuous. Then 


b 
[os 


b 


(30.2) 


a 


b 
where g| means g(b) — g(a): 


b 


= g(b) — g(a). 


a 


g 


30.2 Differentials and integrals 


Consider a function f, differentiable at a point p € R. The tangent line to 
the graph y = f(x) through the point (p, f(p)) has slope f’(p). 


y = f(a) 


tangent line 


df|p(h) = f'(p)h 


Figure 30.1: The differential df at p 


The differential of f at the point p is the function which takes as input 
any horizontal step h and outputs the corresponding vertical rise 


f'(p)h 


if une follows the tangent line to y = f(x) at P(p, f(p)). The differential of 
f at p is denoted df|, and its value on any h € R is thus 


df |p(h) = f'(p)h. (30.3) 
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For example, for the function sin we have 
dsin |,(3) = cos(z) * 3 = —3. 


The function x, that is the function whose graph is y = x, has the value p at 
any point p: 


z(p)=p forallpeR. 


The slope of y = x is 1 everywhere and so 
dz|,(h) =1l*h=h. 


A differential form ® is a function that associates to each point p in some 
domain set S C R a linear function ©], on R, that is 


®|,(h) = Kyh for allh € R, 


for some K, € R. 
Thus, for a differentiable function f on some set S C R, the differential 


df 


is a differential form. 
The product of a differential form ® with a function g is defined as the 
function whose value at p on h is 


(9®)|p(h) = g(p) ®lp(h). (30.4) 

Sometimes we write g® as ®qg; thus 
Og = g®P (30.5) 
Working with a differentiable function f we have 
(df(x))|p(h) = f'()h 

and this is exactly the same as 

(f'(x)dz) |p(h) = f'(p)h. 
Hence we have the extremely useful notational identity: 


a2) =F @)dz. (30.6) 
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All of this notation has been designed to produce this notational consistency: 


df (x) 
dx 


= f'(z), (30.7) 


where on the left we now have a genuine ratio (of functions), not just a formal 
one. 

Using equation (30.6) we can easily verify the following convenient iden- 
tities: 


d(f +9) = df +dg 
dC = 0 if C is constant 
d(fg) = (df)g + fdg 
f\ _ gdf —fadg 
d = p 
df (g(x)) = f'(g(x))dg(x) (this is from the chain rule), 


(30.8) 


where f and g are differentiable functions on some common domain except 
that in the last identity we assume the composite f(g(a)) is defined on some 
open interval. (Note ' means 72, for any differential form ® and function 


f.) 


As an example, we have 
a 1 2 
dlog(sinx*) = ——,, cos(x*) * 2x dz. 
sin x 


If f is a differentiable function on an interval containing points a and b 
we define the integral of the differential df to be 


b 
| #= 10-100. (30.9) 
For example, 


/ d(sin x) = sina — sin(7/2) =0-—1=-1, 


/2 
and 
0 
/ de® = e® el =1-e, 
1 
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where note that the upper endpoint is actually < than the lower endpoint. 

All of what we have done in this section is essentially just notation. Now 
there is a nice convergence of notation with the Riemann integral: for any 
differentiable function g on any interval [a,b], with a,b € R and a < b, we 
have 


[7-90-90 = [age = [eran (30.10) 


[ve dx = i (30.11) 


If f is any continuous function on [a,b] then by the fundamental theorem of 
calculus there is a function F’ on [a,b] for which 


Thus, 


F=f. 


[t-[r-[roe=f tow. 


Thus we have, finally, a complete identification of the Riemann integral as 
the integral of a differential form: 


[i = [1 da. (30.12) 


The Riemann integral is rooted in ideas going back to Archimedes’ compu- 
tation of areas. The notation of differentials was invented in the 1600s by 
Leibniz, but a precise development of differential forms (in higher dimen- 
sions) was done only in the early 20th century by Elie Cartan. The equality 
(30.12) is, for us, a theorem (an easy consequence of the fundamental theo- 
rem of calculus, as we have seen) but traditionally is viewed simply 
as different notation for the same integral. 


Hence 


30.3. Using the fundamental theorem 


Let us start with a simple example. We will work out the integral 


1 
x dx. 
0 
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By Theorem]30.1.2} if we can find a function g for which g'(x) = x on (0, 1] 
then we can easily determine the value of the integral f x dx as g(1)— (1). 
We can easily find a function g for which 


g(t) =2 
for alla € R. Just recall that 
ig?) = 2a; 
and so de 
(x 2) = 227 =. 
dx 2 


Then by Theorem |30.1.2| we have 


1 i 211 1? 02 1 
x 

dr = 2/9) dr = == — = -, 

[ew [@rye 2lo0 2 2 2 


Geometrically, this is the area under the graph of 


y=u 


over x € [0,1]. This is just a rght angled triangle with base 1 and height 1, 
and so its area is indeed (1/2)1-1= 1/2. 
Next consider the area under the parabola 


Yr 


over x € [a,b], where a,b € R with a < b. The area is 


b 
/ x’ dx. 


We need to find a function whose derivative is x”. Recall that 


(a?) = 3a" 


and so 
eia=z. 


Therefore 
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| “ 


Figure 30.2: Area below y = x? for x € [a, 5] 


b b ee 3b b3 
[ ee- [ye Thmfg0.1.2] =|.=37- 5 


Archimedes amazing determination of areas assocated with parabolas has 
thus been reduced to a simple routine calculation. 


Next consider the area under 


y=sing «€ (0,7). 


y=sinaz 


a sinadxz = 2 


Figure 30.3: Area below y = sinz for x € [0,7] 


The area is 


(30.13) 
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30.4 Indefinite integrals 


With the examples of the previous section, it is clear that the task of finding 
aa function g whose derivative g’ is a given function f is crucial to computing 
integrals exactly. The indefinite integral of a function f is the general function 
g for which 


g = f. 
The indefinite integral is denoted 


| t@ae, 


or with t or some other ‘dummy variable’ in place of «x. 
For example, we know that 


(esa) =". 


If g(x) is any function whose derivative is also x? then 


= g (x) = * = ~< 0, 


and, assuming we are working over R (or any interval), it follows that 


ae 
g(x) — ee constant. 


Denoting this ‘arbitrary constant’ by C' we have 


Thus we have the indefinite integral of x: 
3 
/ x dx = > +C, 


where C’' is an arbitrary constant. The presence of this arbitrary constant 
ensures that we have the ‘general’ solution to the problem of finding a func- 
tion whose derivative is x7, not just one special choice. Of course, since the 
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difference between two such solutions is just a constant, it is not a matter of 
great importance. 
In general, in mathematics it is not good practice to use ambiguous no- 


tation, but writing 
[toa 


for not one function but for the class of all functions with derivative f(x) is 
worth the occasional discomfort caused by the ambiguity. 
If a differential form Phi is expressed as 


O =f (x)der 


then we have the integral of ® 


Thus an integral of a differential form ® is a function g for which 
dg = ©. 


Such integrals are of great interest and use in higher dimensions. 
For any integer n, except for n = —1, 


grt 
jae dx = a, (30.14) 


with C being an arbitrary constant, as you can readily check by differentiating 
rr /(n +1): 


+ 1)x" 
n+l 1 C Ee (n at 
(a""/(n +1) +C) aes ae 
What if n = —1? Then we are seeking a function g(a) whose derivative is 
1/a: 
1 
! — 
aes 


Recall that i 
log’ t = - for t € (0, 00). 
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Thus we can write i 
[ew Spouse (30.15) 


for any arbitary constant C’, provided we restrict the functions all to the 
positive ray (0,00). 

Is there a function defined for negative values of x with derivative 1/x? 
It is easily checked that 


dlog(—2) 1 


Te = [log’(—2x)] - (-1) = =D = - for x € (—oo, 0). 


Thus sometimes it is convenient to use the combined formula 
log |x| + C 


for the indefinite integral of 1/x. However, logx and log(—x) do not really 
splice together naturally, and not simply because of the necessary gap at 
x = 0. A full exploration of this would require going into the complex plane, 
a subject well beyond our objectives. 


30.5 Revisiting the exponential function 


Historically the logarithm log was invented before the exponential function. 
However, mathematically, it seems more natural to develop the exponential 
function e* first and then define log as the inverse function. This is essentially 
what we have done, except that we never really defined e*” in a logically 
connected way. In section [26.5] we devloped the properties of e” by assuming 
that there is a function exp on R with the following two properties: 


exp’ = exp 


et (30.16) 


Using this assumption we proved that 


x 


exp(z) =e for all z ER, 


where 
e = exp(1). 
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The function exp has a strictly positive derivative, which implies that it has 
an inverse function. We defined log to be the inverse function of exp, defining 
log a to be the unique real number for which 


loga 


e€ =a, 


with log a being defined for all a € (0,00). Then we showed that 
1 
log’ x = — for all x € (0,00). 
x 


Having developed integration theory we are not at a point where we can 
turn this logical development on its head (thereby regaining the correct, if 
strange, historical development) by defining the function log directly as 


ook 
loga = a —dx for all a € (0, 00). (30.17) 
) 


Since 1/zx is continuous on (0,00), the fundamental theorem of calculus 
guarantees that the integral defining loga exists and is finite; moreover it 
also assures us that 


1 
log’ a = — for all a € (0, 00). (30.18) 
a 
This derivative being strictly positive, log has an inverse function, call it exp. 
Its derivative is (by Proposition |24.1.1): 


i 


/ 
€xp (y) = log’ a’ 


where y = log x, and so 
j 1 
exp (y) = Lx = © = exp(y). 


Moreover, the definition of exp as inverse of log along with the simple fact 
that 

log(1) =0 
shows that 

exp(0) = 1. 


Thus exp is indeed a function satisfying both conditions in (30.16), making 
the development of both exp and log logically complete. 
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Chapter 31 


Riemann Sum Examples 


In this chapter we work out some Riemann sums and compare them with the 
corresponding integrals. 
31.1 Riemann sums for [ a “, 
Let N be a an integer > 1. Consider the partition of [1, N] given by 
BSN VD eg IN 
This breaks up [1, NV] into N — 1 intervals, each of width 1: 
(2, [258] ces IL |e 
The k-th interval has the form 
[kk +1]. 


We will work out the upper and lower Riemann sums for 1/2? relative to 
P. On the interval 
[A,k+1] 


the highest value of 1/2x? is 1/k? and the lowest value is 1/(k+1)?. The area 
of the k-th lower rectangle is therefore 


area of k-th lower rectangle = (e+ 1p . 


257 
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because its width is 1. Similarly 
1 
area of k-th upper rectangle = R 1. 


Hence the lower sum is 


1 1 1 1 
L 1 2 P = -l= | Jose ee 
(l/a",P) — (k +1)? g2 ' 32 + yp 
Similarly, the upper sum is 
N-1 
1 1 1 1 
U(1/x?, P) = l= Ligne 
cake c~ 42 2" 2 (N— 1)? 
The actual integral 
yf 
4 2 
lies between these value: 
2 ~ 2 
Ll fs, P) =< / 72 ot <Utlea Fk (31.1) 


The upper and lower sums are not really very good approximations to the 
value of the intergal because they are separated by quite a bit: 


1 1 1 
Ue P= Eye) a ae = ae 


which tends to 1 as N —> oo. 
We can work out the integral of 1/x? using the fundamental theorem of 


calculus: 
[ da [ gf eas | 
1, we i, a) N ij-~ N- 


Using this in (31.1) produces: 
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We can extract some information from this by focusing on the first inequality: 


a ee ae aia) 
Se aa N27 WN’ 
This is true for all integers N € {2,3,...}. Observe that the sequence of 
sums $1, S9, ... increases in value as additional terms are added on: 


81 < S89 < 83 .<-°+ 


Therefore, there is a limit 


lim sy = sup. Sy. 
N-00 NE{2,3,...} 
From (31.3) we see that 
lm <1 
N-oo 
Thus, 
a ee ee 
N00 [92 ' 3 N2 
This limit is displayed as an ‘infinite series sum’: 
1 1 
a a 


Since the value of this is finite (being < 1) the value of 
1 1 1 
ess 
is also finite, having value < 2. One says that the series 
1 1 1 1 
Dee ie ar aa (31.4) 


n 


converges. 
The convergence of the series above can be seen in other ways, but the 
method using the integral [ 1/2? dx is useful for other similar sums as well. 
The actual value of the sum can be computed by more advanced 
methods; the amazing result is 


ch als an (S15) 
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This identity was established by Euler in 1735. 
Observe that 


dx 
lim — = lm f- 2] =1. 
too 1 2% too t 


We can interpret this to mean that the area under the graph 


all the way over [1, 00) is 1: 


/ ae (31.6) 
i 


72 


where we are taking the integral te on the left side to mean the limit of f 
as t —+ Oo. 


31.2 Riemann sums for 1/z 


For the partition 
a Lae 


of {1, N], where N is any integer > 1, and the function 1/x we have the lower 
sum 


1 
L(1/e,P)=5-1t--+5-1 


and the upper sum 


re 
Cg Da peta a 


The exact integral, which lies between these sums, is 


/ “ = tog(x)|" = log(N) ~ log(1) = log(). 


xv 
Hence 
1 1 1 1 1 
Jfcksto3 -l<logeN <=-1+H—=-14---4 “1. 1. 
D Nee ets te N-1 ano 
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Unlike what happened with 1/zx?, we have an infinite area under the graph 
of 1/x over [1, 00): 


a6 t 
/ dx de ia | ae = lim log(t) =o, (31.8) 
1 £ too J, 2 


t-0o 


Looking back to the second inequality in (31.7) we have 


and so 
oe eee (31.9) 
fon 1 2 3 oo 


The series }>**, is called the harmonic series and the fact that the sum is 
oo is expressed by saying that series is divergent. 
The difference between the upper sum and the integral over [1, N] is 


1 1 1 
i peda = log), tt 


called EFuler’s constant. 


31.3. Riemann sums for xz 


We focus on the function given by f(z) = on [0,1]. 
Let N be a an integer > 1. Consider the partition of [0, 1] given by 


1 N 
Poe Opa ig ae PS 
{055 Ff 


This breaks up [0, 1] into N intervals, each of width 1: 


bal Beal Ea] 
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The k-th interval has the form 


k-1k 
AP. 4 
We will work out the upper and lower Riemann sums for the function x 
relative to P. On the interval 
k-1k 
A 4 


the sup of x is just & and the inf is +. 
The area of the k-th lower rectangle is therefore 


k-1 1 
f k-th 1 tangle = ——— - — 
area O ower rectangle NN 
because its width is 1. Similarly 
f k-th tang aes 
area of k- rr nele = —-—. 
upper rectangle NN 
Hence the lower sum is 
N 
k-1 1 
L(x, P)= ———— 
eet | eg ee (31.11) 
N ' N T N- T N 
1 
=Gp Lb reteeny = 1) 
Similarly, the upper sum is 
N 
k 1 
U(x, P)= She 
ee ae ee (31.12) 
NIN ' N° "iN 
1 
Sag ett A, 


The integral 
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lies between these value: 
1 
EP) < i ede Ue, P). (31:13) 
0 
Observe that U(x, P) differs from L(x, P) by the term x N: 
1 
U(z,P)—L(x,P)= 7 +0 as N — oo. 


This implies that there can be only one number lying between the upper 
sums and the lower sums and hence that the integral 


1 
| x dx 
0 


, 1 
[eae = Jim [1+2+---+N]. 
0 


exists and 


N->0oo N2 


Now we use the sum formula (see (31.24) below): 


N(N +1) 


Lea (31.14) 
Hence 
: 1 N(N+1) . IN-N(14+1/N) .. (141/N) 
[ ede= jim aay = BB 2 a a 
Hence 
; 1 
[ sav=5. (31.15) 
‘4 2 


This result is, of course, far easier to see using the fundamental theorem of 


calculus: 
al 1 1 ! 2,1 
| x dx -| (52) dz = = 
0 0 2 2 


1 
= =, 31.16 
: (31.16) 


0 
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31.4 Riemann sums for x? 


We focus on the function given by f(x) = x? on [0,1]. 
Let N be a an integer > 1. We work with the partition of [0,1] given by 


1 N 
Pe RY hn ati ae 
{0.5 F} 


The k-th interval marked off by this partition is 
k-1k 
NON 
On the interval 
k-1k 
NON 
(k—1)? 


the sup of x? is just a and the inf is “5 
The area of the k-th lower rectangle is therefore 


k-1) 1 
area of k-th lower rectangle = ( mE ) a 
because its width is 1. Similarly 
he. a 


f k-th tangle = —-—. 
area O upper rectangle 2 WN 


Hence the lower sum is 


N 
k-—1)? 1 
L(z?, P) = ( aa 
(2 y) ) > N2 N 
sole, Met ee Ck el (31.17) 
N|N2° N2° N20 ° NP 
1 2 | 2 i 
Similarly, the upper sum is 
N 
ke 
2 — 
k=1 
ee a) ee see (31.18) 
~ N[N2° N2! iN? 
1 
= [ike 92 | N?| 
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1 
| x’ dx 
0 


1 
L(2?, P) < | de Ua?) (31.19) 
0 


The integral 


lies between these value: 


The upper sum U(x?, P) differs from the lower sum L(x?, P) by x3.N?: 
1 
U(2", P)— La’, P) == 40 as N - oo. 


Hence there can be only one number lying between the upper sums and the 
lower sums and so the integral 
i 
| a? dx 
0 


1 
| 
Pie. i 21 92 2 
/ x dx = lim +5 [1 +2? 4---+N?]. 
We now need the sum formula (see (31.31) below): 
N(N +1)(2N +1) 


exists and 


Ae ee = F : (31.20) 
Hence 
’ Pde = Im 2NIN+DON+I) _ 1 N-NG+I/N)NQ+1/N) 
a oe ee Ne 6 ~ wove 3 6 
lin (1+1/N)(2+1/N) 
N->oo 6 
ae! 
=a 
(31.21) 
Hence 
: 1 
| x de= —, (31.22) 
0 3 
This, of course, agrees with the result using the fundamental theorem of 
calculus: : : ; ; 
1 1 1 
| vde= f -73) de= =| =. (31.23) 
0 o \3 310 3 
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31.5 Power sums 


We will work out a few power sum formulas, starting with 
N(N+1 
1+ 2¢--4¢N = TOS) (31.24) 


There are several ways of proving this formula. Let 
Spo ean, 
Writing the same sum backwards we have: 
Se Le 2 eee GN = yay 
i ( ) (31.25) 
SSN UN 1) ee te 


In the first equation above the terms on the right increase by 1 at each step 
whereas in the second equation the terms decrease by 1 at each stage. Adding 
them we have 


25,=(N+1) +(N4+1) +--+ (N+1I=N(N+)). 


Hence 
N(N +1) 


2 
Here is a second, longer and far less appealing method. We start with 


S= 


(aly Sa? H2e41 
which implies 
(1) =a? = 2a +1. 
Now use a = 1, through a+ N in this to obtain: 
27-17 =2+1+4+1 
37-2? —=24+24+1 


(31.26) 


(N+1)?—-N?=24N+1 
Adding all these up we see that on the left all terms cancel except for the 


very first one —1? and the very last one (N + 1)?: 


(N+1)?-1' =2«[14+24+---+N]4N*1. 
e———_=S <_< 
Sy 
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Hence 
29, =(N+1)?-1-N = (N4+1)?-(N41) = (N4+1-1)(N4+1) = N(N+D, 
which implies 

N(N +1) 


= (31.27) 


The advantage of this method is that it works for sums of higher powers. We 
use this method to find the sum 


So = 1? +2? +---4+N?. (31.28) 


We start now with 
(a+ 1)? =a’ + 3a?+3a+1 


from which we have 
(#1) Se" = 3a" +3641, 
Using a = 1 through a = N in this produces 


23 — 13 =3%«17 +31 
Bo 8 oe OF ee Se Oe 


ee (31.29) 
(N +1)? -N?=34N?4+34N 41. 


When we add these up on the left everything cancels except the very first 
term —1° and the very last term (N + 1)?, and we have: 


(N+1)?-—1?=3 [17 4+2?4+---4+N7] +3[14+2+---+N]4+N #1. 
—-cCeov —_- —-eoo'v-: 
So Si 
Hence 


352 = (N +1)? —1=—35, —N 
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Using the formula (31.27) we have 


3S9 =(N +1)? -1-N-35; 

N(N +1) 
2 

3N 


=(N+1)?-(N+1)-3 


=(N +1) [av +3y?-1- | =(N+1) wt+2n+1-1-57 


=(N+1) [+5] = (N+I)NIN + 5] 


a, yeh. 


Hence 
g N(N +1)(2N +1) 
2= . 
6 


Applied to the sum of the cubes this strategy produces 


a). 


Amazingly, this is just the square of the sum $j: 


1942? +4--.+N% = 


PoP pee tN Sta eeeAy. 


3N 
2 


(31.32) 


(31.33) 


Chapter 32 


Integration Techniques 


In this chapter we study some techniques for working out indefinite integrals. 
What this means is that we are given a function f(x) and we have to find a 
function g(x) for which 


dg(a) = f(a)da. 


Since dg(x) is g'(a)dx this means finding a function g whose derivative if f: 


We write the general such function as the indefinite integral 


g(a) = f fa)ae. 


Here f(x) is called the integrand. 


32.1 Substitutions 


Je + 1)? dr 


it would be a long and slow method to work out the fifth power and then 
integrate. Instead we substitute y for x + 1 and rewrite the integral in terms 
of y: 


For the integral 


y=r+1 


269 
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Then 
ay = lds 


and so 


fernrae= fay =i" C= x(x + 1)°+C, 


where C’ is an arbitrary constant. 


The essence of the idea behind the substitution method is simple. 


inspect the integral 


[to dx 


| F(p(2)) p'(a)de, 


and write it in the form 


for some functions F’ and p, and then substitute 
y = p(x) 


to transform the given integral as 


[tae = f Pe) Pwr = | FO) ale) = f Foran, 


and, if al goes well, the integral [ F'(y) dy is ‘simpler’ than what we started 
with, thereby reducing [ f(x) dx to a simpler integral. The main challenge 
is in identifying the functions F' and p which express f(x) as F'\(p(x))p'(x). 
As we will see below there are also some simple variations on this strat- 
egy. For example, it may be easier to write f(x) as a constant multiple of 
F (p(x))p'(a) or, in some cases, we can break up f(x) into a sum of pieces, 


each of which is easier to work out separately. 


Consider 
fe — 5)3/° da. 
We substitute 
DET 9 
which gives 
ds = 2a" and so dz = siz 
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and so 


| 
* 
xR 
ceo 
+ 
me 
Q 


I| 
= No 
S| 
—— 

SS 

8 

| 

on 
emai 

[o.e) 

ne 

ol 

+ 

Q 


where C’ is an arbitrary constant. 


For 
fe — 3x)?!" dex 


the substitution would be 
y=4-32 


which leads to 1 
dy = —3dxz and so dx = —3y 


which then transforms the given integral as follows: 


1 1 
[4-307 a = ee (-54v) = =; [vay 


1 ees 
=—;* YE 
3. Sed 
7 
=-5(4- 3x)7+1 + C, 


where C’ is an arbitrary constant. 
Sometimes the integrand should be reworked a bit before or after the 
substitution is made. For example, for 


[ee — 4)?" da: 


we can substitute 
y=3-4 (32.1) 


for which ; 
dy = —4dx so that = —4y 
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and then the given integral has the form 


1 
2/5 27 
We need to replace the x in the integrand with its expression in terms of y 
by solving (382.1): 
1 
= -(3—y). 
x= 7(3—y) 


Then our integral becomes 


u a5 (1 1 2/5 
“= 27 Sa - dy. 
[56 yy ( j i) ie (3— yy dy 


The right side looks complicated but can be worked out by breaking up into 
pieces: 


L 4 1 
= 2/5 _ 3 | 2/5 -| 2/5 ay = 3 gt+1_ 
fk yy’? dy y?dy— | y-y dy zy! THI 


7/541 


y +constant 


y7/5 


Now putting everything together we have 


1 \ 15 se es 
[ee — 42)?! dx = (-3) ao — (-%) — +c, 


where C is an arbitrary constant and y is as in (32.1). Thus 


15 5 
[eo — 40)" de = —9(3 — 4a)! + 2 (8 — 40)? + ©, 


for any arbitrary constant C. 
Moving on to more complicated integrands, consider 


[@ + 1)? ¢ dx 


Observe that xrdx is about the same as d(x? + 1), aside from a constant 
multiple. The substitution is 


gaa +1, 
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With this we have 
1 
dy =2xdx andso «sdx= gy 
which converts the given integral as follows: 


1 1 1 1 2 
Bake G\2/ 8 ed ta ee yea 2/3 day = — stl ig 
[Gor + Pade ae a a a OR Gs 


where C' is any constant. Rewriting in terms of x we have 


3 
[ler + yeas = Ta Ep igi 
Here is a faster display of this method: 


(2? +1)8nde = | (2? +1)3 d(x? +1) 
/ fe+nk; 


11 
Soma Pl tS. 
3 
= 2 @?4188+¢ 
io * vee 


The same strategy works for more complicated integrands. For example, 
for 


[or — 12e + 2)-79 (x — 2) dx 


observe again that (x — 2)dz is a constant multiple of d(3x? — 5x2 + 2); so we 
use 
y = 3x7 — 12% +2 


for which 
dy = (6x — 12) dx = 6(x — 2)dz 


and this transforms the given integral as follows: 


1 1 
[Geet = 12042) 2) de = fay = 5 fay, 


which integrates out to 


1 1 


1 
/ (32?—12¢-+2)~7/3 (4-2) de = y sto = —5(82°—-120+2)"#+€, 
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for any arbitrary constant C’. 
Sometimes the substitution is not as obvious as in the preceding (manu- 
factured) example. Consider 


Ai 
————_ dx. 
/ oe ed 

Observe that the numerator is 7, which is constant times the dervative of 
x? +1. So we use 

u=2?+1 
for which 

du = 2xdx 


and so 


£ +du 1 


where C’ is an arbitrary constant; thus, 


i 
——— dr = 27414C. 32.2 
| a= ee) 
Similarly, 
My —3d(3 — 22) 
\—=se- —_———— dr 
V3 — 2x? V3 — 2x? 


1 | du 
=—-—— | —— with wu = 3 — 2a 
2J 2/fu (32.3) 


1 
1 
=—5V38— 22° +C, 


where C' is an arbitrary constant. 
Substitutions need not be limited to polynomials or algebraic functions. 


For 
/ dx 
x log x 


we observe the x in the denominator and recall that dlogx = 1/x; so we 
rewrite the integral as 


d 1d d(1 
|= a rid =a ee) = log(log x) + C, 
x log x log x log x 
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for any arbitrary constant C’. 
Similarly, 


dx d(log x) a l 7 
= = /( 3 d(l = | 1 
| wos Jawan [oces) (log x) =F 4 8") aang 


where C' is the arbitrary constant. 
Here is an example with trigonometric functions: 


1 
[sinvcosedy = [sncasin 1 5 sin’ +C, (32.4) 


where C’' is an arbitrary constant. We can do the same integral using the 
substitution cos instead: 


1 
[ sinccoseede = [cosa sine de = — | cose a(cos.) = —5 cos a + C1, 


(32.5) 
where C} is an abitrary constant. For the first time we have now denoted 
the arbitrary constant by C instead of C' and this is the first time we face 
a possible pitfall of the ambiguity in the meaning of the indefinite integral. 
Observe that 

sin? ¢ = 1 — cos” x 


transforms the first expression (32.4]) for f sin x cos x dx into 


1 1 m1 
5 (1 — cos’) +C = —Z cos" r+ 5 +C, 


and this agrees with (32.5) on taking C; to be C+ 1/2. 
Here is a slightly more involved example of this type of substitution: 


1 
[sv wcosrde = [sw eds). = 7 sin’x + C, (32.6) 


where C is an arbitrary constant. 
We can take this a step beyond, with 


/ sin’ x cos! x dx. 
We write this as 


/ sin* x cos® x cos x dz = / sin* x cos® x d(sin x). 
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6° can be written as 


Now we make a key observation: the 6-th power cos 
cos® x = (cos? x)? = (1 — sin? x)*. 


This brings us to 
i. sin’ x cos’ x dx = iy sin‘ z(1 — sin? x)° d(sin z). 
Now we can use the substitution 
y= sing 


to transform the given integral to 
ve — y°)? dy. 


Here we have the integral of a polynomial and though it is a lengthy write-up 
the integration is routine: 


[vo —y) dy = vo — 3y” + 3y* — y®) dy 


= fw’ — 3y° + 3y° — y'®) dy 


l 5 3 7 3 9 1 11 
= sy - sy" + xy? - tC, 
eu ad gd y 


where C’ is the arbitrary constant; substituting back sin x for y produced the 
complete integral: 


1 3 3 1 
[ sit 00s" xd = 5 sin? « — = sin’ x + g sine — 7p sin’ +C. 
32.2 Some trigonometric integrals 


As starting point we have the integrals 


[snvae =—cosr+C, 
(32.7) 
[eoseds =sinz+C, 
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where C, and C2 are arbitrary constants. 
Next up we have the simplest substitutions: 


1 1 1 
[su(ar) ae [su(ar) Gi) a [su(or) disc) = a5 cos(3x) + C 
and, similarly, 
1 
[ cos(5e) a= sin(5z) + C", 


with C and C” being arbitrary constants. 
Our next objective is to work out integrals of the form 


/ Sue eoaCBY) oa 


and other such integrals of products of trigonometric functions. 
The key strategy is the use of the trigonometric sum formulas such as 


sin(a + b) = sinacosb+ sin bcosa 


sin(a — b) = sinacosb — sinbcosa ee) 
Adding these we get 
sin(a + b) + sin(a — b) = 2sinasinb, 
and so i 
sina cos b = 5 [sin(a + b) + sin(a — 6)]. (32.9) 
Thus 


1 1 
sin(3x) cos(7x) = 5 [sin(10x) + sin(—42)] = 5 [sin(10x) — sin(4z)]. 
Integrating, we obtain 


1 1 
cos(102) + — cos(4x)| + C, 


; 1 
[su(or) cos(7x) dx = = Z 


2{ 10 
where C' is any constant. 
The sum formula method works for other trigonometric products. For 
these, recall 
cos(a + b) = cosacosb — sinasinb 


32.10 
cos(a — b) = cosacosb + sina sin b. ( ) 
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Adding we get 
cos(a + b) + cos(a — b) = 2cos(a) cos(b), 
and subtracting we get 
cos(a — b) — cos(a + b) = 2sinasin(b). 
Hence 
1 
cos acosb = = [cos(a + b) + cos(a — b)| 
; (32.11) 
sinasinb = a [cos(a — b) — cos(a + 6)]. 
Using these we have 
1 
[ costa) cos(52) dx = 5 ‘i [cos(8x) + cos(2z)| dx (note that cos(—2x) = cos(2z)) 
1/1. 1, 
aa ; sin(8x) + 5 sin(2x)| + C 
(32.12) 


for any constant C’. 
Similarly, 


[sner) sin(6x) dx = J leostar) —cos(8x)| dx (note that cos(—4x”) = cos(4z)) 


Dlr dle 


F sin (4a) + ssin(Sr) +C 
(32.13) 


for any constant C’. 
We can apply this even to 


/ sin’ x dz, 


viewing the integrand as the product sin x sin x: 


1 1 
sin? s = sing sinx = 5 [cos(0) — cos(2x)] = 5 [1 — cos(2z)] , 
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from which we have 


[sn?zae = ee) (22x)] 


2 
1 

= 5 P3 — =sin(2z) 1: + constant. (32.14) 
ie 


1 
ae aan) += 5 +C, 
where C' is an arbitrary constant. Using the formula 


sin(2x7) = sin x cos x 


we an rewrite the above integral also as: 


1 
[sn? vac = 5 — 5 Sins cose + C. 


Similarly for cos? x we have 
é 1 1 
cos” £ = cos cosx = 5 [cos(0) + cos(2x)] = gl + cos(2z)I, 
which leads to 


2 


We can use the sum-formula strategy multiple times. For example, 


1 1 
[costa dz == E + 5 sin(22) (32.15) 


sin(5z) sin(3x) cos(4z) = ; [cos(2x) — cos(82x)] cos(4z) 
= ; [cos(2x) cos(4x) — cos(8x) cos(4x)] 
— ; E [cos(6a) + cos(22)] — ; [cos(12x) + cos) 


= : [cos(6x) + cos(2x) — cos(122) — cos(42)] 
(32.16) 


from which we have 


[sinGr) sin(3x) cos(4x) dx = : E sin(6x) + 5 sin(22) - 5 sin(12x) — jsin(te) 
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32.3. Summary of basic trigonometric integrals 


Let us recall the following derivatives 


sin’ x = cosx 


cos’ x 


=-—sing 
tan’ x = sec? x 
(32.17) 
csc © = —cscxcotxr 
sec’ x = secxtanx 
cot! © = — csc? x. 
These invert to give the following integrals: 
[cosas =sinz+C 
[sincae =-—cosxz+C 
[se adx =tanx 
(32.18) 


[escwcotw dr = ~esex 
[sccwtan.rde =secx +C 
[este.de=—cote+C. 


Some natural entries are missing from this list. For instance, 
tan x dx. 


This integral can be worked out by observing that in 


sin © 
[tncde -/ Or 
COS & 


the numerator sin x is minus the derivative of the denominator cos.x. So we 
use the substitution 


y = Cosa 
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which gives 


dy = —sinx dx 
and so Fi F 
tanede= f= a- f ¥ _ _logy +C, 
y Y 
and so 
J tonede = ~logoose + ¢ 
Since 


1 
— log A = log ae 


we can also write the integral as 
[tncde = log(sec x) + C. (32.19) 


If we are working over an interval on which sec x is negative we should use 
the absolute value form: 


[roncde = log(secz) +C. 


(In complex analysis the correct integral is (32.19).) 


The integral 
/ sec x dx 


is also of interest. There is no natural method for this integral that leads 
to the value in its simplest form. The following tricky method is based on 
already happening on the correct answer by accident: 


[sccxae - (ed a 


secx + tan x 
sec? x + secx tan x 
sec x + tan x 


(32.20) 


and here we make the remarkable observation that the numerator is the 
derivative of the denominator: 


(secx + tana)’ =secxtanz + sec? x = sec x(secx + tan z), 
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and so we use the subtitution 

y=secxz+tanz. 
Then, as we just observed, 

dy = (sec x)y dx, 


and so 


sec xr dx = au 
i] 


from which we have 


[rccwae= f= ogy +e. 


[sccwde = log(seca + tanz) + C. (32:21) 


Thus 


Again, we can use log|---| if seca + tan is negative. 


32.4 Using trigonometric substitutions 


For the integral 


if dx 
V1 — x? 
the best substitution is 

x= sind. 


This means we are setting @ to be an inverse sin of x; for definiteness we can 
set 


6=sin ‘(z), 
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which restricts 6 to [—7/2,7/2]. Then 


/ dx cos(@) dé 
V1l—« V1-—sin?6 
cos(@) dé 


(which shows why we use x = sin@) 


~ d Veos? 6 
= / eed (here we use cos@ > 0 for @ € [—1/2,7/2].) 


cos 0 
= jw 
=64+C 
= sino ta + C, 
(32.22) 


where C' is an arbitrary constant. We have seen before that 


dsin'(x) 1 


dx V1 — x2’ 


which confirms the integration result. 


A similar integral is 
/ dx 
1+2? 


x =tand 


Here the best substitution is 


for which both 
dx = sec? 6 do 


and 
1+2%=1+4 tan?6 = sec’ 6, 


which simplied the integral: 


dx sec? 6 dO ; 
ee ——— = = = C 
[ro / <2 G Js d+C=tan «+7, 


for any arbitrary constant C’. 
More involved is 


[vir Fae. 
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When we substitute 
x= sind 


and 
dx = cos 6 dé 


we obtain 
vi — 2 dx= / V1-—sin? 6 cos0 dé = / Vcos?@ cos0dO (32.23) 
and here again we have the annoying fact that (at least working with real 
numbers) 
Vv cos? 0 = | cos 0]. 
We get past this bit of unpleasantness by requiring that 
@=sin™' x € [—1/2, 7/2], 


which ensures that 
cos @ > 0 


and so 


V cos? 6 = cos 0. 


Returning to the integration (32.23) we have 
[virFae = [eos0 cos @ dé = [ cos? 0.0. 
Looking back at we have then 
[via = ; C + 5 sin(24) +C, 


where C is an arbitrary constant. We have to substitute back in @ = sin7! «. 
Before doing this observe that 
sin(20) = 2sin 6 cos 0 


and so, since 


sind=xz and cos@=V1-—2?, 
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we conclude that 


1 
[vi — 2dr = x sina tev —27/+C. (32.24) 
Sometimes we need to do some algebra before using a trigonometric sub- 
stitution. Consider for example 


[vow + 127 —G6dz. 


It is best to work the term inside the ,/--- into a ‘completed squares’ form; 
for convenience we work with the negative so that the coefficient of x? is 
positive: 
Aax* — 127 +6 = (2x2)? —2* (Qe) *3+3% —37+6 
= (2x — 3)? + [6 — 37] (using (A — B)? = A? — 2AB + B?) 
= (27 — 3)? — 3. 
(32.25) 


Thus 
—4o? + 122 —6 = 3 — (2x — 3)’, 


and so 


[Vere Teac = f V3— Re 3P ae. 


Once we have this form we need to observe that this looks roughly like 
(32.23). We could now use a sin substitution. Alternatively, we can make 
the similarity with (32.23) greater by substituting 


2a — 3 = VBy, (32.26) 
which ensures that 
— Az” + 122 —6 = 3 — (22 —3)* =3-—3y =30-y), (32.27) 
and 
2dx = V3 dy. 
So 


[vere eae = f VI Sy = Pv f Vim 
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Using (32.24) we have then 
31 
[ve -++ 127 —6dz = 29 js ty tyv1 =| +C, 


Substituting in the value of y from (32.26) yields the complete answer. The 
algebra can be made nicer by observing that 


31 1 
a5 jsin'y ty =¥| a [sin y + 3/1 =| 


= 5 [sin*y + V8uv30— #7) 


= : sin” (==) + (2a — 3)/—42? + 122 — 6 
(32.28) 


where in the final step we used (32.27) and the value of y from (32.26). 


32.5 Integration by parts 


The product rule for derivatives is 


d(UV) = UdV +VdU, 


ii d(UV) = / UdV + / vu, 


provided we line up the ‘arbitrary’ constants appropriately. Then 


puw=-vov- [va 


This often helps in simplifying integrals, and is called the integration by parts 
formula or method. 
For example, 


and so 


prsedr = (log x)x — [ eaQoga) 


=zlo v- fot 
oe z (32.29) 


=cloge— f dz 


=rlogr—x+C. 
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Thus, 


prewar = zlogr—2+C, (32.30) 


where C’ is an arbitrary constant. 
Sometimes the choice of U and V requires some planning ahead: 


1 
[vlog xae = 5 [eee ate’) 


1 

= 5 (log x)” 5 (E: d(log x) 
1 fh d 

_ 5 (log x)” 6 fe? (32.31) 
1 1 

= 5 (log x)x -5 fede 


where C’ is an arbitrary constant. 
We can apply this method to 


[vsine de 


as follows: 


[ vsinede = = | ealcosa) 


=— (os x) — [cose ae (32.32) 
= -coosa + | cose dr 


=-—xcosx+sinz+C 


where C’ is an arbitrary constant. 
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Here is a considerable more involved use of integration by parts: 


[setae = [ sccwsec? de 


= / sec x d(tan x) 


= secx tan x — tan x d( (sec x) 


= sec x tan x — sec x tan? x dx 


=secxtanz— | seca(sec?x—1)dz (using sec? x = tan? x + 1) 


= sec x tan x — [re secxtanx dx 


=secrtana — f sec’ dx+ f sec ds 
(32.33) 


and at this stage it looks bad at first because we have ended up with f[ sec? x dx 
again, on the right. But we are saved because there is a minus sign in front 
of the integral on the right; we can move it to the left to get 


2 / sec? x dx = sec x tan otf sec x dx = sec x tan x+log(sec x+tan x)+constant, 
on using formula (32.21) for [sec x dx. Hence, 
3 1 
sec’ 2 dx = 5 [sec x tan x + log(sec x + tan x)] + C, (32.34) 


where C’ is an arbitrary constant. As usual, a real solution is also obtained if 
secx+tan x on the right is negative by replacing this with its absolute value. 


One other example of this type is 


ie sin( Bx) dx 
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where A and B are non-zero constants. Then we have 


fe 4? sin(Ba) dx = -; | e** d(cos(Ba)) 


ne eed = — [ cos(Be) ale) 
)- | 


B 
1 ee 
= —— cos(Ba) Ae“ dx (32.35) 
B 
ee! 
1B 
1 


’ * cos( Ba) — A | cos(Bx)¢ oF ax] 

Ax A Ax 

e** cos(Ba) + G | vs(Bo) ax. 

Now we run the same method on the integral [ cos(Bx) e4* dz: 
1 

ieee sin(Bx) dr = ae cos(Ba) + Bf a (5 dsin( (Bo)) 


1 
= oe cos(Ba) + e“” dsin(Bx) 


- 
1 A 
= ie cos( Ba) + — * sin( Bax) — [ sin(weyde 


1 A 
- =o cos( Bx) + — e* sin( Bx) — [sn(B)Ac™ a] 


1 
= ao cos(Ba) + ie sin(Bx) — — La sin( Bx) dx, 
(32.36) 


with our original integral reappearing on the right side, with a negative sign. 
Keeping in mind that the integrals on the two sides might differ by a constant 
we have: 


A? 1 A 
(1 + =) Le sin( Bax) dx = =e" cos( Ba) + ka sin(Ba) + constant. 
Multiplying both sides by B? gives 
(A? + B?) ie sin(Ba) dx = —Be“* cos(Bx) + Ae“* sin(Bx) + constant 


= e“* [cos(Br) + Asin(Bx) — B cos(Bzx)] + constant 
(32.37) 
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Dividing by A? + B? produces, at last, the formula 
Asin(Bx) — B B 
ie sin(Bx) dx = e4* saul 2) rm aes 2) 
where C’ is an arbitrary constant. We assumed A and B are both nonzero. 


We can check easily that the formula works even if one of these two values 
is 0. 


C. (32.38) 


Exercises on Integration by Substitution 
1. Work out the following integrals using substitutions: 
(a) [(4—- 32)? dr 
(b) f V2+5ardzx 


C 


) 

(c) f aged 

(d) fa ae hs 
) 
) 
) 


e 


( 
2x¢+1 

(f J Jeanie * 

(g) few? Pad 

(h f VEE) 4/ log(2x+5) We 


22+5 


, 1 
(i aan 


ss It 


Chapter 33 


Paths and Length 


33.1 Paths 


A path c in the plane R? is a mapping 


c: IR? :trvc(t) = (ae(2), yc(t)), (33.1) 


where J is some interval in R. We can think of c(t) as being the position of 
a point at tome t. 
In (33.1) we are denoting the x-coordinate of a point p by x(p): 


x(p) = x-coordinate of a point p, 
so that the z-coordinate of c(t) is (c(t)), which we write briefly as 
nett): 
Similarly, the y-coordinate of a point p is 
y(p) = y-coordinate of a point p, 


and the y-coordinate of c(t) is y(c(t)), which we write briefly as yc(t). 
As our first example, consider 


c(t) = (t, 2t+ 1) fort ER. 


Think of this as a moving point, whose position at clock time t is (t, 2t + 1); 
see Figure [33.1] 


20 
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Figure 33.1: The path c: R > R?: t+ (t, 2¢+ 1) 


This is a point traveling at a uniform speed along a straight line. How fast 
is it traveling? We can check how fast the x and y coordinates are changing: 


ot) = (cet) (ve) (0) = (GA) = c.2) 


This is called the velocity of the path c at time t. Note that for this path the 
velocity is the same, being (1,2), at all times t. 

Here is a different path that also travels along the same line, but with 
increasing speed: 


R > R?: t+ c(t) = (7, 2#7 41). 
This is displayed in Figure [33.2] 


y 


t+ c(t) = (¢?, 2t? + 1) 


¢(0.5) = (0.25, 1.5) 


x 


Figure 33.2: The path c: R > R?: t? } (¢t, 2#? + 1) 
The velocity of this path at time t¢ is 
et) = (2640), 
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which is clearly changing as the clock time t changes. 
The path 


[0, 27] > R? : t + (cost, sin t) 
traces out the unit circle counterclockwise exactly once, starting out at 
(cos 0, sin0) = (1,0) 


and ending also at 
(cos 27, sin 277) = (1,0). 


na 


c(t) = (cost, sint) 


d c(0) = (1,0) 


Figure 33.3: The path [0,27] > R*: t+ (cost, sint). 


The velocity at time t is: 
c(t) = (—sint, cost). 

In this example we have a different interpretation of t also possible: t is 
simply the measure of the angle between the x-axis and the line from the 
origin to the location of the point. 

A path c is said to be continuous if its x and y components xc and yc are 
continuous. It is said to be differentiable at t if its x and y components are 
differentiable at t. The velocity of c at time t is 

c(t) = ((xe)'(¢), (ye)'(). (33.2) 

The speed of the path c at time t is defined to be 


le’) = V(we)'(é)? + (ye)(t)? (33.3) 
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33.2 Lengths of paths 


Consider a path 
ce: [a,b] 4 R?, 


where a,b € Randa < b. Let 


P = {to, ti, . a tw} 
be a partition of |a, b], with 
C= ype wer tie SO. 


We think of these as time instants when we ‘observe’ the path and note 
where it is. Suppose we approximate the path c by a path that travels in 
a straight line from the initial point c(to) to the next point c(t,), and then 
straight to c(t2), and in this way till c(ty) = c(b). The length of this polygonal 
approximation is 


I(c; P) = d(e(to), c(t,)) + d (c(t), c(t) +... + d(e(ty_1), c(tw)), (33-4) 


where d(P,@Q) means the usual distance between points P and Q: 


d(P,Q) = \/(aq — 2p)? + (va — yp? (33.5) 


where 
P=(xrp,yp) and Q= (2Q, YQ). 
If we refine the partition P by adding one more point, say s, which lies 


between ¢;-; and ¢;, then for the new partition P; we have a corresponding 
length 


l(c; P,). 
The difference between this and I(c; P) is clearly 
Wc, Pi) — WGP) = d(e(tj-a), e(s)) + d (cs), e(tj)) — — d(e(tj-a), ety), 


and this is > 0 because of the triangle inequality for distances: 
d(P,S) + d(S,Q) = d(P,Q), 


for any points P, S and Q. 
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Thus adding points to a partition increases the length of the correspond- 
ing polygonal approximation. So if P and P’ are partitions of [a, b] with 


Per, 
we have 
Nee aCe en age (33.6) 


If we keep on adding points to a partition, making it finer and finer, intuitively 
it is clear that the lengths of the polygonal approximations should approach 
the ‘length’ of the path c itself. Hence we define the length of c to be 


i(e= sup ieeP (33.7) 


partitions P of [a,b] 


If c has a continuous derivative then we have a clean and simple formula 
for the length of c; 


b 
i(c) = i J (xc)'(t)? + (yc)'(#)? dt. (33.8) 
Let us apply this to the circle 
c(t) = (cost,sin2) t € [0,27]. 


We have 


20 2a 
l(c) = i /(—sint)? + (cost)? dt = i Lat = 27: 
0 0 


confirming that the circumference of the unit circle is 27. 


33.3 Paths and Curves 


The notion of a curve is natural but there are some choices available in 
defining what it is precisely. Briefly, a curve is a path but without worrying 
about the specific speed at which the path moves. Thus, 


R > R?: tH (t+ Lsin(¢+ 1)) 


and 
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R > R?: t+ (t,sint) 
are the same curve, the only difference between them being that the second 


path is always a bit ‘ahead’ of the first one. 
On the other hand, the paths 


R > R?: t+ (Et, 2t) 


and 


R — R?: (¢?, 2t’) 
don’t correspond to the same curve because the second path always has x- 


coordinate nonnegative whereas the first one has negative values of x. 
We say that a path 


c, : [a,b] 3 R? 


is a reparametrization of a path 
c2 : [a’, b'] > R?’ 
if 
Co(t) = cy (d(t)) for all t € [a’, 6’), 
for some clock-changing mapping 
@: [a’,b'} > [a,b] :t > A(t) 
that is continuous and satisfies 
o(a’)=a and 9(0')=6, 
and 
op) <q) if p<aq 
for all p,q € [a,b]. Thus the path cg is at time t where c; is at time ¢(t). 
We can say that two paths which are reparametrizations of each other 
correspond to the same curve. Alternatively, a curve is a path along with all 
of its reparametrizations. 
Here is an important observation: 


Proposition 33.3.1 Jf a path c, is a reparametrization of a path cy then 
they have the same lengths: 


I(c1) = I(c2). 
Thus we can speak of the ‘length of a curve’. This is also called arc length. 


There is some flexibility in defining curves. One could insist, for example, 
that only differentiable reparametrizations be allowed. 
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33.4 Lengths for graphs 


Consider now a function 


f:[a,b] > R 


where a,b € R with a < b. There is a corresponding natural path 


[a,b] 9 R?: ry (Oey) 


which traces out the graph of f ‘from left to right’. The length of this path 


is 
b 
/ Jit f@Pade. (33.9) 
Here is an example. The graph of 
ol8 4 2/8 = 1 (33.10) 


had four parts, one in the positive quadrant and the others obtained by 
reflections across the two axes. We will work out the length of the full curve 
(a path that goes around the graph). This length is 


1 
af J1+ QP ae. 
0 
Now from the equation (33.10) we have on taking d/dz of both sides: 
2 —1/3 2 -1/3,/ 
2 = 25 (fj 
Ci “ 34 y , 


and so, on doing the algebra, we have 


eee 
y ss 
Then 
2/3 2/3 2/3 
iegsiet 22 22 
"2/3 2/3 2/3 
and so 


ga l/3+1 1 


3 
[ viv uea= | Vie dx = [ a8 dp = =3. 
0 0 0 


1 
Hag LO 
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So the length of the full curve is 


Next consider the curve 


y=z" x é€ (0,al, 
for any a > 0. See Figure [33.4] 


y 2 


a 
Figure 33.4: Arc length for y = 2? 


The length is 


1 a 
| V1+(y')? dz = | V1+ 42? dx 
0 0 
Recall that 


1 
[vive =; wT + w? + log(w + VI w +C, 


where C' is an arbitrary constant. Using 
w=2e 


then gives 
1 
Jv 1+ 4x? 2dr = 5 j2ev1 + 4x? + log(2x + V1 + | +C, 
and so, dividing by 2, we have 


1 
[viva = 5 J20vI + 42? + log(2x +Vv14 | tC, 
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for an arbitrary constant C’. Hence 


“ 1 
| V¥1+ 42? dz = 7 |2av1 + 4a? + log(2a + VI + 4a? 
0 


300 Ambar N. Sengupta 11/6/2011 


Chapter 34 


Selected Solutions 


Solutions for Exercise Set 


1. lim,_,; 5 = 5 because the sup’s and inf’s of the constant function 5 are 
both 5 on any neighborhood of 1. 


2. lim,_,1(a? + 4x — 2) SP pAad 2 = 0, by the rules for limits. 


3 ies SS Se) at 


2-3 ~=6©1-3 
2 ge—1 
4, hms 2 
3 —]| 2 1 
fa 2 a4 (x —1)(a* +2 +1) 
rol 72 — zl (x —1)(a4+ 1) 
ee+e2+1 
=| = 3/2. 
1 z+1 / 
D. 

4 a] 3 2 1 
ee tm & \(ae+a7+2+1) 
rolg?—1 21 (x —1)(a +1) 

3 2 1 
_ eta +ut =4/9=9 
x1 az+l 
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302 Ambar N. Sengupta 11/6/2011 


7. lim soo “82? 
_ 469-3242 . w(4-3+4+4) 
lim —————- =_ lim = 7 
~—00 r2—-xrt+il ~—00 a(1—-2+3) 
4-343 
ene a =) 
ene, Aes vie) 
4—0+0 
1—0+0 
8. 
_ Se®—Te+2 | 265-4434) 
lim — —W——-. = _ lim 7 ; 
~—00 376 +742 ~—00 a(34+ 4+ +) 
—~ 74 2 
= lim ae Se 
zoo 3+ 4 a 
5—-0+0 
= = 5/3. 
3+0+0 / 
9, 
_ Age? +sinag ; a3 (4+ 22) 
lim ————. = _ lim —~——_-# —* 
L000 223 + fx L200 #3 (2+ ¥) 
im 428 
= lim 
4+0 
2+0 


where we used (sinz)/xz*? > 0 as x 4 oo by using, for example, the 
‘squeeze theorem’: 


sin x 


>0 aston, 
iy 


Os 


and also /2/23 = = > 0 as £ > 00. 
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10. 
Tx +ae+cos(e?)  , «(7+ 4+ 82) 
1m = wm 


1 cos x2 
og at + fo 


see ae arene 
_74+04+0 7 
2-040 2’ 


1s lum s3: |p Sale| 


Sol: At first this looks like oo — oo and that’s bad. In order to deterime 
whether one of the two terms wins out over the other we need some 
trick. Here is a method that works often when square-roots are in- 
volved: 


. _ Wetl- Va] [Vve+1+ V2 
jim [Ve +1 — va] = lim ieee 
Tiana eure 

(using (A — B)(A+ B) = A? — B?) 


1 
= lim ——————~ = 0 
x00 y/a¢ + 1+ /& 
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12, 


302 +1-V22+1 3a? + 1 24] 
lim | v3a? + — va? +1] = lim Ste vee VOI gaan Ace 
_ 8a? +1 — (2? 41) 
= lin 
roo 32 +14 fz? +1 
(using (A — B)(A+ B) = A? — B?) 


; 2a? 
= lim 
wr 4 /2?2(3 + 1/x?) + s/2?(1 + 1/2?) 
= lim ze 
woo ¢y/3+1/x? + 4,/14+ 1/2? 
: Dig 
= Tim, 
Pg |/3 + 1/a? + \/14 1/2? | 
r 25 
im 
woo 4/34 1/a2 + ./1 + 1/2? 
_  & 
V3+1 
= 0O. 


13. limyoo [W404 + 2 — Vx? +1] 


Use the same method as for the previous problem and reach 
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lim | var! He ay gt 1 = stuff 


wL—-Co 


' 327 2 1. 
= lim 
ia 1/4 + 2/a4 + /1 
4 4 
eee x*(3+ 1/2*) 
rong 1/4 + 2/4 + /14 Tx 
2 4 
nig x°(3 + 1/2*) 
woo \/4 4+ 2/444 ./1+1/x4 
— 00(38 +0) _ 
cia. os 


1] 


14. limy 599 4H = lim, ow 1/4 = lim, ow 4/1 +2 = VI +0= 1. 
Wr x x 


15; Jinig-sa6 [V2? +2— fx? + 1] 
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lim x 
LOO 
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[V2 +2 — Va? 4+1] [V0? +24 Vo? 41] 


VOI Va ed 


= limsz 


(e? +2)-(@? +) 


BHO a) BPD) gee 


lim 


lim x 


1 


eco /e(1 42/02) + fa + 1/ 


1 


woe y/1 + 2/x2 + 2/1 + 1/22) 


1 


2/02 + /14 1/2)) 


t /1 + 1/2? 
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16. 
Je+1— Jz] [Ve+1+ V2] 
li a ee eae, gl 
jim Ve+2|ve+1— Ve] = lim ve Perea 
xet+1l-2 
= lim V 2 
pico. | etl tft 
1 
= lim V 2 
pio | Jp tlt Je 
li (1 + 2/z) : 
= hm x x 
oe full + 1/a) +4/ze 
1 
= lim VrV/14+2/r 
pas / JiV1t1/r+ Sfx 
1 
= lim /rv/14+ 2/2 
a-t00 vz (/1+T/2 +1) 
1 
= lim /1+2/z 
@—400 / J1l+1/e+1 
1 1 
— 1 SS ee 
1+1 2 
17. limg_so sin(@") = lim,0 me = 1, on setting 


y= 6? 


and noting that y > 0 as 6 > 0. 


: sin? : sin 0\2 
18. limg_,o a = limg_,9 ( ne) SiS 7 


19. limg+n/6 ee = lim;-50 ee = |, on setting 


r=0-7/6 


308 


20. 


21, 


22) 


23. 
Sol: 


24. 
Sol: 


Ambar N. Sengupta 11/6/2011 


and noting that this > 0 as 0 > 7/6. 


lim, +0 2719(x) = 0 from the ‘squeeze’ theorem on using 


0< |z719(z)| <-g* —>0 as x > 0. 


lim,+9 x(1 — x)1Q(x) = 0 from the ‘squeeze’ theorem on using 


|x(1 — z)le(x)| < |x -—2)| 40 as x — 0. 


lim,+1 2(1 — x)1g(x) = 0 from the ‘squeeze’ theorem on using 


|x(1 — x)1g(x)| < |x -—2)| 3 0 as x — 1. 


Explain why lim,_,3 x(a — 1)1@(x) does not exist. 


Near x = 3 the supremum of the values x(x — 1)19(z) is around 3(3 — 
1) = 6 (actually, more than this, because if x > 3, with x rational, then 
x(x — 1)1g(x) = x(a — 1) > 3(3 — 1)), whereas the inf is 0 on taking x 
irrational. 


Explain why lim,,,. cos x does not exist. 


cos x oscillates between 1 and —1 as x runs from any integer multiple 
of 27 (such as 0, 27, 47, 67,...) and the next higher such multiple. So 
for any positive real number ¢ we have 


sup COS x = 00, and inf cosx = —1. 
xE(t,co) rE (t,co) 
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Since there is no unique value lying between 1 and —1, the limit 
lim,-.., cos x does not exist. 


25. Explain why lim,_,.. xsin x does not exist. 


Sol: sina oscillates between 1 and —1 as x runs from any integer multiple 
of 27 (such as 0, 27, 47, 67,...) and the next higher such multiple. So 
for any positive real number t we have 


sup sinx = 00, and inf sing = —1. 
xE(t,oo) rE (t,0o) 


Since there is no unique value lying between 1 and —1, the limit 
lim,_,.. sin x does not exist. 


26. Explain why lim, watt = 0. 
Sol: This follows, for instance, by the ‘squeeze’ theorem on observing that 


sin x 
> 0 as © > OO. 


Go| iat 


27. Explain why lim, _,. vA = 0. 
Sol: This follows, for instance, by the ‘squeeze’ theorem on observing that 
sin © 


Va | [ve 


Solutions for Exercise Set 


> 0 as © —> OOo. 


<|J 


(i) For the set 
S' = [—o0, —1) U (1, 2] U {6,8} U [9, oo] 


write down 
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(i) an interior point: —2 
(ii) a limit point: —5. 
(iii) a boundary point: —1 
(iv) an isolated point: 6. 
(v) the interior S° = [—oo, -1) U (1, 2) U (9, co] 


(vi) the boundary 0S = 


Sol: 0S = {-1,1,2,6,8,9}. Note that —oo and oo are interior point 
of S. 


2. Answer and explain briefly: 


(i) If 4 < sup T is 4 an upper bound of T”? 
(ii) In (i), is there a point of T that is > 4? 
(iii) If inf T < 3 is 3 a lower bound of T? 
Sol: Since inf T’ is the greatest lower bound of T’,, it follows that 3, being 
> inf T, is not a lower bound of 7’. 
(iv) In (iii), is there a point of T that is < 3? 


Sol: Since 3 is not a lower bound of T' there must be a point of T’ that 
is < 3. 


3. Answer the following concerning limits, with brief explanations: 


(i) If lim,1 F(x) = 2 does it follow that F(1) = 2? 
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Sol: 


Sol: 


(vi) 
Sol: 


No, the definition of the limit lim,_,; F(x) contains no information 
about the value of F' at 1. For example, 


P= 25 | foralla 41s 
0 Ig = ly 


has lim,_,; F(x) = 2 but F(1) = 0. 


If g is continuous at 3 is g differentiable at 3? 


: No, a function can be continuous at a point without being differ- 


entiable at that point. For example, the function g given by 
g(x) = |x -3] for all x ER, 


is continuous everywhere but is not differentiable at 3. 


If g is differentiable at 5 is g continuous at 5? 


: Yes, we have proved that if a function is differentiable at a point 


then it is continuous at that point. 


If h’(5) = 4 and h(5) = 8 then lim,_,5 h(x) = 


: Since h’(5) = 4 the function h is differentiable at 5. Therefore it 


is continuous at 5. Hence lim,_,; h(a) is equal to h(5), which is 
given to be 8. Thus, lim, ,; h(x) = 8. 


If H’(2) =5 and H(2) = 3 then lim, 2 ~“= = 


w-—2 


The limit here is 
H — H — H(2 
im 6!) = 3 = pi OY) — FO) 
w2 W— 2 w—>2 WwW 2 


We recognize this to be the derivative H’(2), which is given to be 
5. Thus the value of the limit is 5. 


If G'(5) = 1 and G(5) = 6 then lim,,; ©" = 
The limit here is 
tim Ch) — 8 _ ye GW) — G5) 


yd y-—9o y2 y—d 


We recognize this to be the derivative G’(5), which is given to be 
1. TThus the value of the limit is 1. 
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(vii) lim,,9 @" = 


Sol: This is 1. 


(viii) lim, ./s ==) = 


Sol: We recognize this limit as the derivative of sin at 7/3, and so its 
value is sin’(7/3) = cos7/3 = 1/2. 
(ix) If G’(3) = 4 then 


lim G(3 + h) — G(3) _ 
h—0 h 


Sol: We recognize this limit as the derivative of G at 3 and so its value 
is G'(3) = 4, as given. 


4. Work out the following derivatives: 
(i) dyV/wt—2w?+4 
dw 


Sol: Using the chain rule we have 


dyV/wt — 2w2 +4 1 


= 4w* — 4w +0 
dw 2 =o Uae) 
(ii) Hive 
Sol: Using the product rule we have 
d[(l+/y)tany]  d[(1+ /y) dtan y 
= t + (14 
Fe tan y + (1+ VS 


1 
= 0+ so) any + 1+ ,/y) sec” y 
Gee, + Vo 
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Sol: Using the quotient rule we have 


cee tan yftv) i (1 Jd) dtany 


tany dy dy 
dy tan? y 
_ tan y(0 + 7A) —(1+./y)sec*y 
tan? y 
= (tany) x7 — (1+ ./y) sec? y 
tan? y 


-\ deot 
(iv) co 


Sol: Using the quotient rule we have 


COS Z& 
dcotx da 
dx dx 
sinx-(—sinx) — cosx”-cosx 
sin? x 
—(sin? x + cos? 2) 
sin? x 
1 
sin? x 
= —cse’ gz. 


(v) d sin(cos(tan(/2) ) ) 


dx 
Sol: Using the chain rule repeatedly we have 


ead (cos fa (vz))) = cos (cos (tan (/z))) - [= sin (tan (/z))] 
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5. Using the definition of the derivative, show that 


Sol: 


woe (w — 2) Jw /t 
— am VE= VON VE + Vo) 
woe (w— 2) /w/e(/z + Jv) 
(Ja)? = (vn)? (34.1) 
(w—2)J/w/2(/x + Jw) 
= lim —= 


woe (w—2)/Jw/a(/z + Jw) 
; —1 
=e Jw /r(/z + Jw) 
i] 
Vale 2fz 
1 


~ 224/% 
Solutions for Exercise Set 


1. Find the maximum and minimum values of x? for x € [—1, 2]. 


Sol: The deriative of f(x) = x? is 2x, which is 0 at x = 0. Of the values 
f(0) = 0, f(—1) = 1, and f(2) = 4, the minimum value is 0 and the 
maximum is 4. 


2. Find the maximum and minimum values of «(6—2x)(3—2) for x € [0, 2]. 


Sol: For f(x) = 2(6 — x)(3 — x) = x? — 92? + 182, we have f'(x) = 3x? — 
18z + 18, and this is 0 when x? — 6x + 6 = 0, the solutions of which 
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Sol: 


Sol: 


are 3+ V3 (instead of using the formula for solutions of quadratic 
equations it is easier to observe that x? — 6x + 6 = 2? —2*3r+9=3 
and this is (x — 3)? = 3. Now 3 + V3 is not in [0,2] and 3 — V3 is in 
[0,2]. We compare the values f(0) = 0, f(2) = 8, and f(3 — V3) = 
(3 — V3)(3 + V3) V3 = 6V3 ~ 10.39. So the maximum of f on [0, 2] is 
6/3 at x =3— V3, and the minimum is 0 at « = 0. 


A wire of length 12 units is bent to form an isosceles triangle. What 
should the lengths of the sides of the triangle be to make its area 
maximum? 


Let the sides be x, x, and 2y (it will be clear soon that it is better 
to take the third side to be 2y than y, the algebra being a bit easier). 
Then 2x + 2y = 12, sox+y=6. The minimum possible value of y is 
0 (the triangle collapses into a line then) and the maximum value of y 
is 3 (again the triangle collapses into a flat line). 


vay +h? 


h=/r-¥ 


1 
A= 52yh=y xg? — y? 


= yV/6 —y)? — y? = yv/ 36 — 12y 
y y 


Thus y € [0,3]. Since we have to work out the derivative of A’ it 
will be a little easier, for algebra, to actually work the maximizing 
A? = y?(36 — 12y) = 36y? — 12y° rather than A (of course, then we 
can just take the square root at the end). The derivative is (A)! = 
72y — 36y? = 36y(2—y). This is 0 at y = 0 and at y = 2, both of these 
being in [0,3]. When y = 0 or y = 3 then A is 0. So the maximum 
must occur when y = 2 and this value is 2\/36 — 12 * 2 = 4V3. 


. A piece of wire is bent into a rectangle of maximum area. Show that 


this maximal area rectangle is a square. 


Let L be the length of wire, and the suppose the rectangle has sides 
x and y. Then the perimeter is 27 + 2y = L, thus x + y = L/2. The 


316 


Sol: 


6. 
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area is A = xy. Thus the problem is to maximize the product of two 
numbers given that their sum is L/2. We write Aas A = x(L/2—2) = 
(L/2)x—2«?. The restriction of x is that it lies in [0, L/2]. We know that 
the maximum value of a quadratic can be obtained without calculus, 
but it is simple to work out the derivative A’ = L/2 — 2x and observe 
that this is 0 when x = L/4. Then y = (L/2) — (L/4) = L/4. Thus 
the rectangle has equal sides, which means that it is a square. This 
confirms intuition. 


. A piece of wire of length LD is cut into pieces of length 7 and L — x 


(including the possibilty that x is 0 or L), and each piece is bent into a 
circle. What is the value of x which would make the total area enclosed 
by the pieces maximum, and what is the value of « which would make 
this area minimum. 


If a length x is bent into a circle the radius of the circle is 2/(27). 
The area is n[x/(2m)]”. For the remaining length L — x the area is 
m|(L— x)/(2n)]”. Thus the total area is 


A(a) = r[x/(2m)]? + 4[(L —2)/(2n)]? = a" a Gp aah 


The value « lies in [0, L]. The derivative of A is 


1 1 1 
A'(x) = —x2+—(L—2)(-1) = —(2r-L). 
27 
This is 0 when x = L/2, that is, with the wire split into two equal 
pieces. The area enclosed is then 


ae 


A(L/2) = =D - Si me) oan 


An 


The end point values for x are 0 and L, corresponding to rolling the wire 
up into just one large circle of length L/(2m); its area is A(0) = -L?. 
Thus the minimum are is obtained by splitting the wire into two equal 
pieces, each rolled into a circle, and the maximum area is obtained by 


rolling up the entire length of wire into a circle. 


Here are some practice problems on straight lines and distances: 
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(i) Work out the distance from (1,2) to the line 3x = 4y + 5 
(ii) Work out the distance from (2, —2) to the line 4x — 3y —5 = 0. 


(iii) Find the point Pp on the line L, with equation 3x + 4y —7 = 0, 
closest to the point (0,3). What is the angle between PoP and 
the line L? 


(iv) Let Po be the point on the line L, with equation 37 +4y—11 = 0, 
closest to the point P(1,3). What is the slope of the line P,P? 


(v) Let Po be the point on the line L, with equation 3x + 4y —11 = 0, 
closest to the point P(1,3). Find the equation of the line through 
P and P). 


Solutions for Exercise Set 


(a) Work out the following integrals using substitutions: 


i. Use 
u=4— 32 and du = —3dz. 
So then 
dz = ——du 
fe — 32)? dz = pe (~34u) 
= =; fw au 
; , (34.2) 
an 241 C 
3 2 i 7 + 
1 
= =A =r; 2. 
uu. Use 


y=24+ 52 and dy = ddz. 
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Then 
1 
[vir %eae = | vizay 
1 
=5 f vay 
It ek 241 
55+] 
12>. 
ee as 6) 
53” 
2 
St br)? +0 
iii. Use 
w=2-— 3x and dw = —3dx 
So 
eee 
ae: 
Then 
1 1 1 
dx = | —=|--])d 
a 7 i: = ( 5) ™ 
_ 7 i 
a oe 
1 
=; fw aw 
: : (34.4) 
= +140 
3-541 
2 1/2 
=—- C 
3 + 
= —-(2—32)/7+C 
iv. Use 
y=3—- 22. 
Then 


dy=—2dr and dr= —5 dy. 
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4/441 ai 
5 9/5 5 7/5 
= tC 
for 56" 
5 9/5 7/5 
ene 2 2 
Ti gyi? + 6 (3 ey? +C 
(34.5) 
Use the substitution 
y= 2+ oD. 
Then 
dy = 5dx, 
and 


— =(y~ 2) 
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So 


xL Liy—2) 1 
| orem | ae 5 


ee ne 
=, 


Yy 2 
ysl _ ysl 


Sle Sle Sle Sle Sle Sle 


Fae ‘8s 
Iw) 
~~ 
ot 
Q 
< 
| 
bo 

Sx 
Neng 

do 

a 

or 

Q 

Neng 
[ cee 


+ 
rae 
| 
ouw 
+ 
re 


< 
~ 
~ 
ot 
On 
Ned 
tw 
ae 
ot 
(eaeeales 
+ 
eS) 


I 
SL ee _ eS 


NI] Ot NI] Ot ours 


(2452) — 5(2 + 50) +C. 
(34.6) 


vi. scree — = dz Use y = x? +245, for which dy = (2x + 1)dzr 
sae Uh, 30 “the integral becomes 


[= soy VU C= WP HEHE +C. 


6 ey) 
vii. fe @/24 dx 


viii. Use y = —x?/2, for which 
dy = —xdx 


and so 


pews dx = fea = - fe dy = —e#+C = -e- “P40. 


4/ log(2x+5) 
ix. {== at ) de 


2¢+5 
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Use y = log(2x + 5), for which 
1 


= ———3 
iy 27 +5 sia 
and then 
Vlog(2% +5) | 
dy== f yd 
a De 15 a= | Vas y 5 fy y 


and this equals 5 say +C and so 


,/log(2 5) 


22 +5 


1 
ci J Seeeeniaes) dar 


Using y = log x converts the integral to 


1 
bree 
ylogy 


and then w = logy converts this to [ + dw = logw+C, and 
so 


1 
dx = log log 1 C. 
lam log(log x) Papoose 


[sinGo) cos(2x) dx = i ; [sin(S5a + 2x) + sin(5x2 — 2x)] dx 


1 1 1 
= -7 cos(7x) — 3 008 30 +C 


2 
(34.7) 
xii. 
1 
[ sur) sin(22) d. o= fol cos(5xz — 2%) — cos(5x + 2x)] dx 
1/1 1 
a5 E sin 3x2 — 7 sin 7e| +C 


(34.8) 
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xiii. f cos(5x) cos(2x) dx = $ [4 sin 7x + ¢sin 3x] + C 
xiv. f sin’ x dx 


3 


sin° xv = sinxsinxsinx 
1 
= 5 [cos 0 — cos(2x)| sin 
1 : 
= — [1 —cos2z]sinz 
2 
Psy 
= 5 sine — sin x cos 22] 
1 1 34.9 
= — {sing — = [sin 3z + sin(—2)| ( ) 
2 2 
ae cael 
= g sine — 7 [sn3e — sine 
Dl). oie , ae 3 
= —sinx + —sinx — —sin32 
2 4 4 
3, 13 
= —sing — —sin 
q sing — 7sin3e 


Integration gives 


3 1 
[sv8cae a a cos x + Jp Cos 8e + ©. 


xv. f cos® «dx 


cos? X® = COS LY COS L COS V 


= — |cos0 + cos(22)| cos x 


Re bdo| re 


= — [1+ cos 22] cos x 
= [cos 7 + cos x cos 2] (34.10) 


1 
= 5 [coset 5 [cos 3a + cos(—2)| 


1 
= scott 7 [cos 3a + cos 2] 


1 
= + — 3 
COS © n COS 3X 
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Integration gives 


3 1 
[costae = q sing + 7p Sin da + ©. 


Xvl. 
1 
[se°r) de= [sn60) sin(Sa aa = / 5 [cos 0 — cos(5a + 52)| dx 
and so 


1 1 1 
v2 a — = Gy 
[om (S2).de = . / [1 — cos(10ax)] dx ; E ip Si8 102 +C. 


xvii. f V3 — 62 — 2? dx 


Completing the square for x? + 62 — 3 we have 


a’ +6¢ —3=2774+2%3*2+3? —3?-3=(4+3)?-12 


and so the integral is 


[ vB (x + 3)? dx 
Substitute 
r+3=V12sin0 


for which 
dx = V12cos6 dé 
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and so 


| Via= rapa = f Vi2—12sin®6 ViBa8 
= vi f [1201 - sin? cos 6 dé 


ee 12 | cos cos .d8 


= 12 | cos? 6 dé = 12 / ; [cos 0 — cos(20)| dé 


6 f (1 — cos 28 dé 
Le, 
=6|4~ 5sin26] +c 


= 6|@ — sin@ cos 6] + C 


(34.11) 
Now substitute back in 
rt+3 
sind = —— and cosé=,4/12—(r#+4+3)? 
AS ( ) 
to get 
r+3 2£4+3 
12 — (x + 3)2dz = 6 |sin- | —— — 12 — (x + 3)?/+C. 
pore Va yav~ &*%) 
oe 1 
XViil. J aS da 
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arc length, composite function, [138] 
composite functions, 
absolute value composites 
definition, of continuous functions, 
larger of x and —x, concave function, [206] 
AM-GM inequality, strictly concave, 
angle continuity 
as a pair of rays, at a point, 
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irrationals in R, 


rationals in R, 
dense subset 

Qin R, 

Q° in R, 
derivative 


as slope of tangent line, [116] 
at a point, 
definition, [116] 
finiteness implies continuity, |131 
ale) 
notation df . \/d me 
notation f’(p), rial [TT 
of constant is 0, [116] 
sign of, 
zero and constancy, |180 
derivatives 
algebraic rules, [133] 
differential df, 
differential form, |247 
differential forms 
working rules, 
discontinuity 
removable, 
discriminant, |155 


distance 

on R, 

triangle inequality, 
divergence 

of $7, 1/n, 
domain 


of a function, 
dummy variable, 


elements, 
empty set 

as subset, 
empty set Q, 
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extended real line 
exterior points, 


factorials, 


functions, 
definition, 


graph 
of a function, 
of a function f, 


of unit circle, 
greatest lower bound, 


harmonic series 
divergence, 

Hausdorff property, 

Havil, Julian, 


increasing functions, [102] 
indefinite integral, 
indicator function, 
infimum 

larger for larger set, 

notation inf S, 

of 0, 
integers, 
integrability 

and continuity, |240 
integral 

of a differential, 

of a differential form, |253 
integrand, 
integration 

by parts, |286 

by substitution, [269] 
interior 

notation S°, 
interior point, 
intermediate value theorem, 
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constructing rational powers, local maximum, |168 


with intervals, local minmum, |168 
intersections, log 
interval, and exp, [193 
inverse sin: sin! or arcsin, [100 as inverse exp, |193 
inverse function, graph, 
derivative of, notation In, [193] 
inverse trigonometric functions, logarithm 
irrational numbers, definition using 1/z, 
irrationals, lower bounds, 
of 0), |30 
L’Hospital’s rule, ‘ 
least upper bound, magnitude, 
length Maor, Eli, [325 
of a path, mappings 
lHospital’s rule definition, 
proof, maps 
limit definition, 
as unique value between suprema maxima and minima 
and infima, existence, 
notation, [49] with infinities, [110] 


of 1/z as x > 04, 

of 1/z as x > 0-, 

of 1/x as x + —o0, |50| 

of 1/x as x + oo, |49| 
limits 


Mean Value Theorem, |172 
for ’Hospital’s rule, 

minimizing quadratics, 

monotone function, 


‘squeeze’ theorem, Nahin, Paul J.,|825 
‘squeezing’, Napier, John, 
and ratios, neigborhoods, 
between suprema and infima, neighborhoods 

by comparision, and distance, 


definition with neighborhoods, of +00, 
of composites, 
of sums, open sets, [33] 


products, complements of closed sets, 
the notion, finite intersections, 
with neighborhoods, unions are open, 


linear combination, [220] ordered pairs, 
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polygonal approximation sets, 
to paths, [294] equality, 

powers intersections of, 
rational, definition, unions of, 
real, definition, [196 sin 

product rule geometric meaning, [70] 
and integration, [286] squeeze theorem, 


strictly decreasing functions, 
sala ae strictly increasing functions, {102| 
ae utions, [154 subsets, 
quasi-tangents : 
achatica WE properties, [16] 
a nition, [3 ; _ sum of cubes 1? + --- + N?,|268 
at at interior maxima/minima, sum of squares 12 + --. + N?,]267 


168 ; : 
summation notation, |53 
Rolle’s theorem, |172 bata 
. ry: ts. IE supporting line, 
uniqueness and tangents, [115] andi tangent Ene. 
radian measure, i aay 
Ramanujan formula, notation sup S, 
range of 0, 
of a function, smaller for larger set, 
rational numbers, 
notation Q, in 
deal line geometric meaning, 
real numbers, Vangent 


and maxima/minima, [168] 
tangent line 


notation R, 
Riemann integral 


definition, 230 and supporting line, 
Riemann sum, 237| definition, [114] 
Rolle’s Theorem, [171] triangle inequality 
Rolle’s theorem for distance, 

with derivatives, [172] for magnitudes, [42] 

trigonometric functions 

secan function sec , [74] sin(1/x) and cos(1/z), 
secant sin? + cos? = 1, 

to a graph, addition formulas, 
semi-chord bounds, 


and sin, bounds for (sin x)/2, 
set theory, continuity, 
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geometric meanings, 
periodicity 27, 

the limit lim,,9 =“ = 1, 
values at 0, 

values at 7/2, 

values at a and —a, 
values at a and 2a, 


uniform continuity, |240 


unions, 
upper bounds, 
for Q, 


velocity 


of a path, 
weights, 


